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Abstract 



Most existing natural language interfaces to databases (Nlidbs) were designed to be 
used with "snapshot" database systems, that provide very limited facilities for manip- 
ulating time-dependent data. Consequently, most Nlidbs also provide very limited 
support for the notion of time. In particular, they were designed to answer ques- 
tions that refer mainly to the present, and do not support adequately the mechanisms 
that natural language uses to express time. The database community is becoming 
increasingly interested in temporal database systems. These are intended to store and 
manipulate in a principled manner information not only about the present, but also 
about the past and future. When interfacing to temporal databases, it becomes crucial 
for Nlidbs to interpret correctly temporal linguistic mechanisms (verb tenses, temporal 
adverbials, temporal subordinate clauses, etc.) 

I argue that previous approaches to natural language interfaces for temporal databases 
(Nlitdbs) are problematic, mainly because they ignore important time-related linguis- 
tic phenomena, and/or they assume idiosyncratic temporal database systems. This 
thesis develops a principled framework for constructing English Nlitdbs, drawing on 
research in tense and aspect theories, temporal logics, and temporal databases. I first 
explore temporal linguistic phenomena that are likely to appear in English questions 
to Nlitdbs. Drawing on existing linguistic theories of time, I formulate an account for 
a large number of these phenomena that is simple enough to be embodied in practical 
Nlitdbs. Exploiting ideas from temporal logics, I then define a temporal meaning rep- 
resentation language, Top, and I show how the Hpsg grammar theory can be modified 
to incorporate the tense and aspect account of this thesis, and to map a wide range 
of English questions involving time to appropriate Top expressions. Finally, I present 
and prove the correctness of a method to translate from Top to Tsql2, Tsql2 being 
a temporal extension of the Sql-92 database language. This way, I establish a sound 
route from English questions involving time to a general-purpose temporal database 
language, that can act as a principled framework for building Nlitdbs. To demon- 
strate that this framework is workable, I employ it to develop a prototype Nlitdb, 
implemented using the Ale grammar development system and Prolog. The proto- 
type Nlitdb can map temporal questions from a non-trivial fragment of English to 
appropriate TSQL2 queries. 
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Chapter 1 



Introduction 



"No time like the present." 

1.1 Subject of this thesis 

Over the past thirty years, there have been significant advances in the area of natural 
language interfaces to databases (Nlidbs). Nlidbs allow users to access information 
stored in databases by typing requests expressed in natural language (e.g. English). 
(The reader is referred to [ Perrault fc Grosz 88 1, | Copestake Sparck Jones 90fl , and 



[|Androutsopoulos et al. 95b ] for surveys of Nlidbs.^]) Most of the existing Nlidbs 
were designed to interface to "snapshot" database systems, that provide very limited 
facilities for manipulating time-dependent data. Consequently, most Nlidbs also pro- 
vide very limited support for the notion of time. In particular, they were designed 



to answer questions that refer mainly to the present (e.g. (1.1) - ( |1.3| )), and do not 
support adequately the mechanisms that natural language uses to express time. For 
example, very few (if any) temporal adverbials ("in 1991", "after 5:00pm", etc.) and 
verb forms (simple past, past continuous, past perfect, etc.) are typically allowed, and 
their semantics are usually over-simplified or ignored. 

(1.1) What is the salary of each engineer? 

(1.2) Who is at site 4? 

(1.3) Which generators are in operation? 

1 The project described in this thesis began with an e xtensive survey of Nlidbs. The results of this 



survey were reported in [ Androutsopoulos et al. 95b | 
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The database community is becoming increasingly interested in temporal database 
systems. These are intended to store and manipulate in a principled manner in- 
formation not only about the present, but also about the past and the future (see 



[Tansel et al. 93] and [Jensen et al. 93] for an introduction, and [Bolour et al. 



| McKenzie 86 ], | Stam fc Snodgrass 88 ], | Soo 91 |, Kline 95 ], and | Tsotras fc Kumar 



for bibliographies). When interfacing to temporal databases, it becomes crucial for 
Nlidbs to interpret correctly the temporal linguistic mechanisms (verb tenses, tempo- 
ral adverbials, temporal subordinate clauses, etc.) of questions like (|1.4Q - (|l.6|) . 



(1.4) What was the salary of each engineer while ScotCorp was building bridge 5? 



(1.5) Did anybody leave site 4 before the chief engineer had inspected the control 
room? 

(1.6) Which systems did the chief engineer inspect on Monday after the auxiliary 
generator was in operation? 



In chapter |7|, I argue that previous approaches to natural language interfaces for tempo- 
ral databases (Nlitdbs) are problematic, mainly because they ignore important time- 
related linguistic phenomena, and/or they assume idiosyncratic temporal database 
systems. This thesis develops a principled framework for constructing English Nl- 
itdbs, drawing on research in linguistic theories of time, temporal logics, and temporal 
databases. 



1.2 Some background 

This section introduces some ideas from Nlidbs, linguistic theories of time, temporal 
logics, and temporal databases. Ideas from the four areas will be discussed further in 
following chapters. 



1.2.1 Natural language interfaces to databases 

Past work on Nlidbs has shown the benefits of using the abstract architecture of figure 
1.1 . The natural language question is first parsed and analysed semantically by a lin- 
guistic front-end, which translates the question into an intermediate meaning represen- 
tation language (typically, some form of logic) . The generated intermediate language 
expression captures formally what the system understands to be the meaning of the 
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Figure 1.1: Abstract architecture of many modern NLIDBs 



natural language question, without referring to particular database constructs. The 
intermediate language expression is then translated into a database language (usually 
SQL lUllman 88f jMelton fc Simon 93[ ) that is supported by the underlying database 
management system (DBMS; this is the part of the database system that manipulates 
the information in the database). The resulting database language expression specifies 
what information needs to be retrieved in terms of database constructs. The Dbms 
retrieves this information by evaluating the database language expression, and the 
obtained information is reported back to the user. 

Most Nlidbs can only handle questions referring to a particular knowledge-domain 
(e.g. questions about train departures, or about the employees of a company), and 
need to be configured before they can be used in a new domain. The configuration 
typically includes "teaching" the Nlidb words that can be used in the new domain, and 
linking basic expressions of the formal intermediate language to database constructs 
(see section 6 of [ Androutsopoulos et al. 95b|| ). 



The architecture of figure LI has proven to have several advantages (see sections 5.4 
and 6 of |Androutsopoulos et al. 95b| ), like modularity (e.g. the linguistic- front end is 
shielded from database-level issues), and Dbms portability (the same linguistic front- 
end can be used with Dbmss that support different database languages). This thesis 
examines how this architecture can be used to construct Nlitdbs. 



1.2.2 Tense and aspect theories 

In English, temporal information can be conveyed by verb forms (simple past, past con- 
tinuous, present perfect, etc.), nouns ("beginning", "predecessor", "day"), adjectives 
("earliest", "next", "annual"), adverbs ("yesterday", "twice"), prepositional phrases 
("at 5:00pm", "for two hours"), and subordinate clauses ("while tank 4 was empty"), 
to mention just some of the available temporal mechanisms. A linguistic theory of 
time must account for the ways in which these mechanisms are used (e.g. specify what 
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is the temporal content of each verb form, how temporal adverbials or subordinate 
clauses affect the meaning of the overall sentences, etc.). The term "tense and aspect 
theories" is often used in the literature to refer to theories of this kind. (The precise 
meanings of "tense" and "aspect" vary from one theory to the other; see JC omrie "76 ] 



and [ Comrie 85 1 for some discussion. Consult chapter 5 of [ Kamp fc Reyle 93 1 for an 



extensive introduction to tense and aspect phenomena.) 

It is common practice in tense and aspect theories to classify natural language expres- 
sions or situations described by natural language expressions into aspectual classes. 
(The term "AJctionsarten" is often used to refer to these classes.) Many aspectual 



classifications are similar to Vendler's taxonomy | Vendler 67 1, that distinguishes be- 
tween state verbs, activity verbs, accomplishment verbs, and achievement verbsj^j For 
example, "to run" (as in "John ran. 7 ') is said to be an activity verb, "to know" (as in 
"John knows the answer.") a state verb, "to build" (as in "John built a house.") an 
accomplishment verb, and "to End" (as in "Mary found the treasure.") an achievement 
verb. 

Vendler's intuition seems to be that activity verbs describe actions or changes in the 
world. For example, in "John ran." there is a running action in the world. In contrast, 
state verbs do not refer to any actions or changes. In "John knows the answer." there 
is no change or action in the world. Accomplishment verbs are similar to activity verbs, 
in that they denote changes or actions. In the case of accomplishment verbs, however, 
the action or change has an inherent "climax" , a point that has to be reached for the 
action or change to be considered complete. In "build a house" the climax is the point 
where the whole of the house has been built. If the building stops before the whole of 
the house has been built, the building action is incomplete. In contrast, the action of 
the activity verb "to run" (with no object, as in "John ran.") does not seem to have 
any climax. The runner can stop at any time without the running being any more or 
less complete. If, however, "to run" is used with an object denoting a precise distance 
(e.g. "to run a mile"), then the action does have a climax: the point where the runner 
completes the distance. In this case, "to run" is an accomplishment verb. Finally, 
achievement verbs, like "to End", describe instantaneous events. In "Mary found the 
treasure." the actual finding is instantaneous (according to Vendler, the time during 



According; to Mourelatos Mourelatos 76], a similar taxonomy was developed independently in 



Kenny 63[, where Kenny notes that his classification is similar to the distinction between kine- 



seis and energiai introduced by Aristotle in Metaphysics, 0. 10486, 18-36. 
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which Mary was searching for the treasure is not part of the actual finding). In contrast, 
in "John built a house." (accomplishment verb) the actual building action may have 
lasted many years. 

Aspectual taxonomies are invoked to account for semantic differences in similar sen- 
tences. The so-called "imperfective paradox" [ Dowty 77| jLascarides 88 ] is a well- 



known example (various versions of the imperfective paradox have been proposed; see 



| Kent 93 ]). The paradox is that if the answer to a question like ( |l.7|) is affirmative, then 
the answer to the non-progressive (|l.8| ) must also be affirmative. In contrast, an affir- 
mative answer to (|1.9| ) does not necessarily imply an affirmative answer to (|l~10| ) (John 
may have abandoned the repair before completing it). The Nlitdb must incorporate 
some account for this phenomenon. If the Nlitdb generates an affirmative response to 
(|1.7| ), there must be some mechanism to guarantee that the Nlitdb's answer to ( |1.8| ) 
will also be affirmative. No such mechanism is needed in (|1.9| ) and ( |1.10|) . 

(1.7) Was IBI ever advertising a new computer? 

(1.8) Did IBI ever advertise a new computer? 

(1.9) Was J.Adams ever repairing engine 2? 

(1.10) Did J.Adams ever repair engine 2? 



The difference between ([□]) - ([f|) and Q - ( |i~To|) can be accounted for by classify- 



ing "to advertise" as an activity, "to repair" as an accomplishment, and by stipulating 
that: (i) the simple past of an accomplishment requires the climax to have been reached; 
(ii) the past continuous of an accomplishment or activity, and the simple past of an 
activity impose no such requirement. Then, the fact that an affirmative answer to ( |1.9| ) 



does not necessarily imply an affirmative answer to ( 1.10 ) is accounted for by the fact 



that ( 1.10 ) requires the repair to have been completed, while (|1.9|) merely requires the 
repair to have been ongoing at some past time. In contrast (|0^) does not require any 
climax to have been reached; like ( |1.7| ), it simply requires the advertising to have been 



ongoing at some past time. Hence, an affirmative answer to (1.7) implies an affirmative 
answer to (|1.8;), It will become clear in chapter [2] that aspectual taxonomies pertain 
to the semantics of almost all temporal linguistic mechanisms. 
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Time is an important research topic in logic, and many formal languages have been 
proposed to express temporal information [ vanBenthem 83| | [Gabbay et al. 94]. One of 



the simplest approaches is to use the traditional first-order predicate logic, introducing 
time as an extra argument of each predicate. ( l.ll|) would be represented as ( 1.12| ), 



where t is a time-denoting variable, -< stands for temporal precedence, C for temporal 
inclusion, and now is a special term denoting the present moment. The answer to 
( 1.11 ) would be affirmative iff (|l,12| ) evaluates to true, i.e. iff there is a time t, such 



that t precedes the present moment, t falls within 1/10/95, and tank 2 contained water 
at t. (Throughout this thesis, I use "iff" as a shorthand for "if and only if".) 

(1.11) Did tank 2 contain water (some time) on 1/10/95? 

(1.12) 3t contain(tank2, water, t) At -< now At C 1/10/95 

An alternative approach is to employ temporal operators, like Prior's P (past) and F 



(future) | Prior 67 ]. In that approach, formulae are evaluated with respect to particular 
times. For example, contain(tank2, water) would be true at a time t iff tank 2 con- 
tained water at t. Assuming that (p is a formula, P<fi is true at a time t iff there is a time 
t', such that t' precedes t, and (f) is true at t' . Similarly, F<f> is true at t iff there is a t', 
such that t' follows t, and (j> is true at t' . "Tank 2 contains water." can be expressed as 
contain(tank2, water), "Tank 2 contained water." as P contain{tank2, water), "Tank 
2 will contain water." as F contain(tank2, water), and "Tank 2 will have contained 
water." as F P contain{tank2, water). Additional operators can be introduced, to 
capture the semantics of temporal adverbials, temporal subordinate clauses, etc. For 
example, an On operator could be introduced, with the following semantics: if <p is a 
formula and k specifies a day (e.g. the day 1/10/95), then On[n, <ft] is true at a time 
t iff t falls within the day specified by k, and <p is true at t. Then, ( [1.11 ) could be 
represented as ( 1.13 ). 



(1.13) P On[l / 10 / 95 , contains(tank2, water)} 

The intermediate representation language of this thesis, called Top, adopts the opera- 
tors approach (Top stands for "language with Temporal OPerators"). Temporal oper- 



ators have also been used in Dowty 82 ], [ Lascarides 88 1, [ Richards et al. 89 ], [Kent 



|Crouch fc Pulman 93 1, Pratt fc Bree 95| , and elsewhere. 
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Unlike logics designed to be used in systems that reason about what changes or 
remains the same over time, what can or will happen, what could or would have 
happened, or how newly arrived information fits within already known facts or as- 
sumptions (e.g. the situation calculus of [McCarthy & Hayes 6£], the event calculus of 
[ Kowalski fc Sergot 86(1 , and the logics of flAllen 8$1 , jAllen 84fl , and flMcDermott 82 ] 



- see [ Vila 94 ] for a survey), Top is not intended to be used in reasoning. I provide 
no inference rules for Top, and this is why I avoid calling Top a logic. Top is only 
a formal language, designed to facilitate the systematic mapping of temporal English 
questions to formal expressions (this mapping is not a primary consideration in the 
above mentioned logics). The answers to the English questions are not generated by 
carrying out reasoning in Top, but by translating the Top expressions to database 
language expressions, which are then evaluated by the underlying Dbms. The defini- 
tion of Top will be given in chapter ||, where other ideas from temporal logics will also 
be discussed. 



1.2.4 Temporal databases 



In the relational model [ podd 70| , currently the dominant database model, information 
is stored in relations. Intuitively, relations can be thought of as tables, consisting of 
rows (called tuples) and columns (called attributes). For example, the salaries relation 
below shows the present salaries of the current employees of a company. In the case 
of salaries, whenever the salary of an employee is changed, or whenever an employee 
leaves the company, the corresponding tuple is modified or deleted. Hence, the database 
"forgets" past facts, and does not contain enough information to answer questions like 
"What was the salary of T.Smith on 1/1/1992?". 



salaries 


employee 


salary 


J.Adams 
T.Smith 


17000 
19000 



It is certainly true that traditional database models and languages can and have been 
used to store temporal information. (This has led several researchers to question the 
need for special temporal support in database systems; see [ pavies et al. 95 ] for some 
discussion.) For example, two extra attributes (from and to) could be added to salaries 
(as in salaries2) to time-stamp its tuples, i.e. to show when each employee had the 
corresponding salary. 
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salaries2 


employee 


salary 


from 


to 


J.Adams 


17000 


1/1/88 


5/5/90 


J.Adams 


18000 


6/5/90 


9/8/91 


J.Adams 


21000 


10/8/91 


27/3/93 


T.Smith 


17000 


1/1/89 


1/10/90 


T.Smith 


21000 


2/10/90 


23/5/92 



The lack of special temporal support in traditional database models and languages, 
however, complicates the task of expressing in database language time-related data 
manipulations. We may want, for example, to compute from salaries'! a new relation 
same_salaries that shows the times when J.Adams and T.Smith had the same salary, 
along with their common salary: 



samesalaries 


salary 


from 


to 


17000 
21000 


1/1/89 
10/8/91 


5/5/90 
23/5/92 



That is, for every tuple of J.Adams in salaries!, we need to check if the period specified 
by the from and to values of that tuple overlaps the period specified by the from and 
to values of a tuple for T.Smith which has the same salary value. If they overlap, we 
need to compute the intersection of the two periods. This cannot be achieved easily in 
the present version of Sql (the dominant database language for relational databases 
| Unman 8§ ] | Melton fc Simon 9q ]), because Sql currently does not have any special 
commands to check if two periods overlap, or to compute the intersection of two periods 
(in fact, it does not even have a period datatype). 

As a further example, the approach of adding a from and a to attribute to every 
relation allows relations like rell and rel2 to be formed. Although rell and rel2 
contain different tuples, they represent the same information. 



rell 




rell 


employee 


salary 


from 


to 


employee 


salary 


from 


to 


G.Foot 


17000 


1/1/88 


9/5/88 




G.Foot 


17000 


1/1/88 


31/5/89 


G.Foot 


17000 


10/5/88 


9/5/93 




G.Foot 


17000 


1/6/89 


10/8/92 


G.Foot 


18000 


10/5/93 


1/3/94 




G.Foot 


17000 


11/8/92 


9/5/93 


G.Foot 


18000 


2/3/94 


11/2/95 




G.Foot 


18000 


10/5/93 


11/2/95 


G.Foot 


17000 


12/2/95 


31/3/96 




G.Foot 


17000 


12/2/95 


31/3/96 
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Checking if the two relations represent the same information is not easy in the current 
Sql version. This task would be greatly simplified if Sql provided some mechanism 
to "normalise" relations, by merging tuples that apart from their from and to values 
are identical (tuples of this kind are called value-equivalent). In our example, that 
mechanism would turn both rell and rel2 into re/3. To check that rell and veil 
contain the same information, one would check that the normalised forms of the two 
relations are the same. 



reli 


employee 


salary 


from 


to 


G.Foot 
G.Foot 
G.Foot 


17000 
18000 
17000 


1/1/88 
10/5/93 
12/2/95 


9/5/93 
11/2/96 
31/3/96 



Numerous temporal versions of Sql and the relational model have been proposed (e.g. 
flClifford & Warren 83|| , [|Ariav 86| , jTansel 86j 1 , [ fcnodgrass 871 , jNavathe fe Ahmed 88fl , 
flGadia 88 ], | Lorentzos fc Johnson 88 1; see | McKenzie &: Snodgrass 9l[| for a summary 
of some of the proposals). These add special temporal facilities to Sql (e.g. predicates 
to check if two periods overlap, functions to compute intersections of periods, etc.), and 
often special types of relations to store time-varying information (e.g. relations that 
force value-equivalent tuples to be merged automatically). Until recently there was 
little consensus on how temporal support should be added to Sql and the relational 
model (or other database languages and models), with every researcher in the field 
adopting his/her own temporal database language and model. Perhaps as a result 
of this, very few temporal Dbmss have been implemented (these are mostly early 
prototypes; see poehlen~95" [). 



This thesis adopts Tsql2, a recently proposed temporal extension of Sql-92 that 
was designed by a committee comprising most leading temporal database researchers. 
(Sql-92 is the latest Sql standard [Melton fc Simon~95]. Tsql2 is defined in [Snodgrass 95] 



An earlier definition of Tsql2 can be found in [ Snodgrass et al. 94a |.) Tsql2 and the 
version of the relational model on which Tsql2 is based will be presented in chapter 
||, along with some modifications that were introduced to them for the purposes of this 
thesis. Until recently, there was no implemented DBMS supporting Tsql2. A proto- 
type system, however, which is capable of evaluating Tsql2 queries now exists. (This 



system is called TimeDB. See [ Boehlen 95 [ for a brief technical description of TimeDB. 



TimeDB actually supports Atsql2, a variant of Tsql2. See [[Boehlen et al. 96| for 
some information on ATSQL2.) 
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Researchers in temporal databases distinguish between valid time and transaction 
time^\ The valid time of some information is the time when that information was 
true in the world. The transaction time of some information is the time when the 
database "believed" some piece of information. In this thesis, I ignore the transaction- 
time dimension. I assume that the natural language questions will always refer to 
the information that the database currently believes to be true. Questions like ( 1.14| ), 



where "on 2/1/95" specifies a transaction time other than the present, will not be 
considered. 

(1.14) According to what the database believed on 2/1/95, what was the salary of 
J.Adams on 1/1/89? 

1.3 Contribution of this thesis 



As mentioned in section 1.1, most existing Nlidbs were designed to interface to snap- 
shot database systems. Although there have been some proposals on how to build 
Nlidbs for temporal databases, in chapter ^ I argue that these proposals suffer from 
one or more of the following: (i) they ignore important English temporal mechanisms, 
or assign to them over-simplified semantics, (ii) they lack clearly defined meaning rep- 
resentation languages, (iii) they do not provide complete descriptions of the mappings 
from natural language to meaning representation language, or (iv) from meaning rep- 
resentation language to database language, (v) they adopt idiosyncratic and often not 
well-defined temporal database models or languages, (vi) they do not demonstrate 
that their ideas are implementable. In this thesis, I develop a principled framework for 
constructing English Nlitdbs, attempting to avoid pitfalls (i) - (vi). Building on the 



architecture of figure 1.1 



I explore temporal linguistic phenomena that are likely to appear in English 
questions to Nlitdbs. Drawing on existing linguistic theories of time, I formulate 
an account for many of these phenomena that is simple enough to be embodied 
in practical Nlitdbs. 

Exploiting ideas from temporal logics, I define a temporal meaning representation 
language (Top), which I use to represent the semantics of English questions. 



3 



I adopt the consensus terminology of Jensen et al. 93 1. A third term, user-defined time, is also 



employed in the literature to refer to temporal information that is stored in the database without 
the DBMS treating it in any special way. 
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I show how Hpsg pollard k. Sag 87f pollard & Sag 94j , currently a highly re 



garded linguistic theory, can be modified to incorporate the tense and aspect 
account of this thesis, and to map a wide range of English questions involving 
time to appropriate Top expressions. 

• I present and prove the correctness of a mapping that translates Top expressions 
to Tsql2 queries. 

This way, I establish a sound route from English questions involving time to a general- 
purpose temporal database language, that can act as a principled framework for con- 
structing Nlitdbs. To ensure that this framework is workable: 

• I demonstrate how it can be employed to implement a prototype Nlitdb, us- 



ing the Ale grammar development system | Carpenter 92 1 [ Carpenter fc Penn 94 1 



and Prolog Clocksin fc Mellish 94 ] |Sterling fc Shapiro 94 1. I configure the pro- 



totype Nlitdb for a hypothetical air traffic control domain, similar to that of 



[ Sripada et al. 94 } 



Unfortunately, during most of the work of this thesis no Dbms supported Tsql2. 



As mentioned in section 1.2.4, a prototype Dbms (TimeDB) that supports a version 
of Tsql2 (Atsql2) was announced recently. Although it would be obviously very 
interesting to link the Nlitdb of this thesis to TimeDB, there is currently very little 
documentation on TimeDB. The task of linking the two systems is further complicated 
by the fact that both adopt their own versions of Tsql2 (TimeDB supports Atsql2, 
and the Nlitdb of this thesis adopts a slightly modified version of Tsql2, to be 
discussed in chapter JBJ). One would have to bridge the differences between the two 
Tsql2 versions. Due to shortage of time, I made no attempt to link the Nlitdb of 
this thesis to TimeDB. The Tsql2 queries generated by the Nlitdb are currently not 
executed, and hence no answers are produced. 

Although several issues (summarised in section 8.2) remain to be addressed, I am con- 



fident that this thesis will prove valuable to both those wishing to implement Nlitdbs 
for practical applications, and those wishing to carry out further research on Nlitdbs, 
because: (a) it is essentially the first in-depth exploration of time-related problems the 
Nlitdb designer has to face, from the linguistic level down to the database level, (b) 
it proposes a clearly defined framework for building Nlitdbs that addresses a great 



CHAPTER 1. INTRODUCTION 12 

number of these problems, and (c) it shows how this framework was used to implement 
a prototype Nlitdb on which more elaborate Nlitdbs can be based. 

Finally, I note that: (i) the work of this thesis is one of the first to use Tsql2, and one 
of the first to generate feedback to the Tsql2 designers (a number of obscure points 
and possible improvements in the definition of Tsql2 were revealed during this project; 
these were reported in [ Androutsopoulos et al. 95a[| ); (ii) the prototype Nlitdb of this 



thesis is currently one of the very few Nlidbs (at least among Nlidbs whose grammar 
is publicly documented) that adopt HPSG.Q 

1.4 Issues that will not be addressed 

To allow the work of this thesis to be completed within the available time, the following 
issues were not considered. 

Updates: This thesis focuses on questions. Natural language requests to update 
the database (e.g. ( 1.1E| )) are not considered (see [ Davidson Kaplan 83 ] for work on 



natural language updates.) 

(1.15) Replace the salary of T.Smith for the period 1/1/88 to 5/5/90 by 17000. 



Assertions like ( |1.16| ) will be treated as yes/no questions, i.e. ( 1.16| ) will be treated in 
the same way as ( |1.17|) . 

(1.16) On 1/1/89 the salary of T.Smith was 17000. 

(1.17) Was the salary of T.Smith 17000 on 1/1/89? 

Schema evolution: This term refers to cases where the structure, not only the 
contents, of the database change over time (new relations are created, old deleted, 
attributes are added or removed from relations, etc.; see [ McKcnzic fc Snodgrass 90| ). 



Schema evolution is not considered in this thesis. The structure of the database is 
assumed to be static, although the information in the database may change over time. 



4 



mechanisms, was used in 



See also [Cercone et al. 93 ]. A version of the Hpsg grammar of this thesis, stripped of its temporal 



jeldrup 95| to construct a Nlidb for snapshot databases. 
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Modal questions: Modal questions ask if something could have happened, or could 
never have happened, or will necessarily happen, or can possibly happen. For example, 
"Could T.Smith have been an employee of IBI in 1985?" does not ask if T.Smith was 
an IBI employee in 1985, but if it would have been possible for T.Smith to be an IBI 



employee at that time. Modal questions are not examined in this thesis (see [ Mays 86 1 



and |Lowden et gl. 91b | for related work). 



Future questions: A temporal database may contain predictions about the future. 
At some company, for example, it may have been decided that T.Smith will retire two 
years from the present, and that J.Adams will replace him. These decisions may have 
been recorded in the company's database. In that context, one may want to submit 
questions referring to the future, like "When will T.Smith retire?" or "Who will replace 
T.Smith?". To simplify the linguistic data that the work of this thesis had to address, 
future questions were not considered. The database may contain information about 
the future, but the framework of this thesis does not currently allow this information 
to be accessed through natural language. Further work could extend the framework of 
this thesis to handle future questions as well (see section |8.2| ) . 



Cooperative responses: In many cases, it is helpful for the user if the Nlidb 
reports more information than what the question literally asks for. In the dialogue 
below (from | Johnson 85| ), for example, the system has reasoned that the user would 



be interested to know about the United flight, and has included information about that 
flight in its answer although this was not requested. 

(1.18) Do American Airlines have a night flight to Dallas? 

(1.19) No, but United have flight 655. 



In other cases, the user's requests may be based on false presumptions. ( 1.20| ), for 



example, presumes that there is a flight called BA737. If this is not true, it would be 
useful if the Nlidb could generate a response like ( |1.21| ). 

(1.20) Does flight BA737 depart at 5:00pm? 

(1.21) Flight BA737 does not exist. 



The term cooperative responses Kaplan 82 ] is used to refer to responses like ( 1.19 ) and 
( 1.21| ). The framework of this thesis includes no mechanism to generate cooperative 
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responses. During the work of this thesis, however, it became clear that such a mech- 
anism is particularly important in questions to Nlitdbs, and hence a mechanism of 



this kind should be added (this will be discussed further in section 3.2) 



Anaphora: Pronouns ("she", "they", etc.), possessive determiners ("his", "their"), 
and some noun phrases ( "the project", "these people") are used anaphorically, to refer 
to contextually salient entities. The term nominal anaphora is frequently used to refer 
to this phenomenon (see [Hirst 81] for an overview of nominal anaphora, and [Hobbs 86] 



for methods that can be used to resolve pronoun anaphora). Verb tenses and other 
temporal expressions (e.g. "on Monday") are often used in a similar anaphoric manner 
to refer to contextually salient times (this will be discussed in section |2.12 ). The term 



temporal anaphora [Partee 84] is used in that case. Apart from a temporal anaphoric 
phenomenon related to noun phrases like "the sales manager" (to be discussed in 
section [2.11 ), for which support is provided, the framework of this thesis currently 
provides no mechanism to resolve anaphoric expressions (i.e. to determine the entities or 
times these expressions refer to). Words introducing nominal anaphora (e.g. pronouns) 



are not allowed, and (excluding the phenomenon of section 2.11 ) temporal anaphoric 
expressions are treated as denoting any possible referent (e.g. "on Monday" is taken 
to refer to any Monday). 



Elliptical sentences: Some Nlidbs allow elliptical questions to be submitted as 
follow-ups to previous questions (e.g. "What is the salary of J.Adams?" , followed by 
"His address?"; see section 4.6 of [ Androutsopoulos et ol. 95b| ] for more examples). 
Elliptical questions are not considered in this thesis. 



1.5 Outline of the remainder of this thesis 



The remainder of this thesis is organised as follows: 

Chapter 2 explores English temporal mechanisms, delineating the set of linguistic phe- 
nomena that this thesis attempts to support. Drawing on existing ideas from tense 
and aspect theories, an account for these phenomena is formulated that is suitable to 
the purposes of this thesis. 

Chapter 3 defines formally Top, discussing how it can be used to represent the se- 
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mantics of temporal English expressions, and how it relates to other existing temporal 
representation languages. 

Chapter 4 provides a brief introduction to Hpsg, and discusses how Hpsg can be 
modified to incorporate the tense and aspect account of this thesis, and to map English 
questions involving time to appropriate Top expressions. 

Chapter 5 defines the mapping from Top to Tsql2, and proves its correctness (parts 
of this proof are given in appendix [A]). It also discusses the modifications to Tsql2 
that are adopted in this thesis. 

Chapter 6 describes the architecture of the prototype Nlitdb, provides information 
about its implementation, and explains which additional modules would have to be 
added if the system were to be used in real-life applications. Several sample English 
questions directed to a hypothetical temporal database of an airport are shown, dis- 
cussing the corresponding output of the prototype Nlitdb. 

Chapter 7 discusses previous proposals in the area of Nlitdbs, comparing them to the 
framework of this thesis. 

Chapter 8 summarises and proposes directions for further research. 



Chapter 2 



The Linguistic Data and an 
Informal Account 

"There is a time for everything.'''' 

2.1 Introduction 

This chapter explores how temporal information is conveyed in English, focusing on 
phenomena that are relevant to Nlitdbs. There is a wealth of temporal English 
mechanisms (e.g. verb tenses, temporal adverbials, temporal adjectives, etc.), and it 
would be impossible to consider all of those in this thesis. Hence, several English 
temporal mechanisms will be ignored, and simplifying assumptions will be introduced 
in some of the mechanisms that will be considered. One of the goals of this chapter is 
to specify exactly which linguistic phenomena this thesis attempts to support. For the 
phenomena that will be supported, a further goal is to provide an informal account of 
how they will be treated. 

Although this chapter draws on existing tense and aspect theories, I stress that it is 
in no way an attempt to formulate an improved tense and aspect theory. The aim is 
more practical: to explore how ideas from existing tense and aspect theories can be 
integrated into Nlitdbs, in a way that leads to provably implementable systems. 
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2.2 Aspectual taxonomies 



As mentioned in section 1.2.2 , many tense and aspect theories employ aspectual clas- 
sifications, which are often similar to Vendler's distinction between states (e.g. "to 
know", as in "John knows the answer."), activities (e.g. "to run", as in "John ran."), 
accomplishments (e.g. "to build", as in "John built a house."), and achievements (e.g. 
"to find", as in "Mary found the treasure."). 

Vendler proposes a number of linguistic tests to determine the aspectual classes of 
verbs. For example, according to Vendler, activity and accomplishment verbs can ap- 
pear in the progressive (e.g. "John is running", "John is building a house"), while state 
and achievement verbs cannot (* "John is knowing the answer.", * "Mary is finding the 
treasure"). Activity verbs are said to combine felicitously with "for ... " adverbials 
specifying duration ("John ran for two minutes."), but sound odd with "in ... " du- 
ration adverbials (? "John ran in two minutes."). Accomplishment verbs, in contrast, 
combine felicitously with "in ..." adverbials ( "John built a house in two weeks."), but 
sound odd with "for ... " adverbials (? "John built a house for two weeks."). Finally, 
according to Vendler state verbs combine felicitously with "for ... " adverbials (e.g. 
"John knew the answer for ten minutes (but then forgot it)."), while achievement verbs 
sound odd with "for ... " adverbials (? "Mary found the treasure for two hours."). 

The exact nature of the objects classified by Vendler is unclear. In most cases, Vendler's 
wording suggests that his taxonomy classifies verbs. However, some of his examples 
(e.g. the fact that "to run" with no object is said to be an activity, while "to run a mile" 
is said to be an accomplishment) suggest that the natural language expressions being 
classified are not always verbs, but sometimes larger syntactic constituents (perhaps 
verb phrases). In other cases, Vendler's arguments suggest that the objects being 
classified are not natural language expressions (e.g. verbs, verb phrases), but world 
situations denoted by natural language expressions. According to Vendler, "Are you 
smoking?" "asks about an activity", while "Do you smoke?" "asks about a state". In 
this case, the terms "activity" and "state" seem to refer to types of situations in the 
world, rather than types of natural language expressions. (The first question probably 
asks if somebody is actually smoking at the present moment. The second one has a 
habitual meaning: it asks if somebody has the habit of smoking. Vendler concludes 
that habits "are also states in our sense".) 
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Numerous variants of Vendler's taxonomy have been proposed. These differ in the 
number of aspectual classes they assume, the names of the classes, the nature of the ob- 



jects being classified, and the properties assigned to each class. Vlach [ Vlach 93| distin- 



guishes four aspectual classes of sentences, and assumes that there is a parallel fourfold 



taxonomy of world situations. Moens Moens 87 ] distinguishes between "states", "pro- 



; 'culminated processes", "culminations", and "points", commenting that his 
taxonomy does not classify real world situations, but ways people use to describe world 



situations. Parsons flParsons 89J distinguishes three kinds of "eventualities" ("states", 



"activities", and "events"), treating eventualities as entities in the world. Lascarides 



| Lascarides 88 ] classifies propositions (functions from time-periods to truth values), 



distinguishing between "state", "process", and "event" propositions. 

2.3 The aspectual taxonomy of this thesis 

Four aspectual classes are employed in this thesis: states, activities, culminating activ- 
ities, and points. (Culminating activities and points correspond to Vendler's "accom- 



plishments" and "achievements" respectively. Similar terms are used in [ Moens 87 1 and 



flBlackb urn et al. 94| .) These aspectual classes correspond to ways of viewing world sit- 



uations that people seem to use: a situation can be viewed as involving no change or 
action (state view), as an instantaneous change or action (point view), as a change or 
action with no climax (activity view), or as a change or action with a climax (culmi- 
nating activity view). (Throughout this thesis, I use "situation" to refer collectively 
to elements of the world that other authors call "events", "processes", "states", etc.) 
Determining which view the speaker has in mind is important to understand what the 
speaker means. For example, "Which tanks contained oil?" is typically uttered with 
a state view. When an "at ... " temporal adverbial (e.g. "at 5:00pm") is attached to 
a clause uttered with a state view, the speaker typically means that the situation of 
the clause simply holds at the time of the adverbial. There is normally no implication 
that the situation starts or stops holding at the time of the adverbial. For example, 
in "Which tanks contained oil at 5:00pm?" there is normally no implication that the 
tanks must have started or stopped containing oil at 5:00pm. In contrast, "Who ran 
to the station?" is typically uttered with a culminating activity view. In this case, an 
"at ..." adverbial usually specifies the time when the situation starts or is completed. 
"Who ran to the station at 5:00pm?", for example, probably asks for somebody who 
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started running to the station or reached it at 5:00pm. 

Some linguistic markers seem to signal which view the speaker has in mind. For 
example, the progressive usually signals a state view (e.g. unlike "Who ran to the 
station at 5:00pm?", "Who was running to the station at 5:00pm?" is typically uttered 
with a state view; in this case, the running is simply ongoing at 5:00pm, it does not 
start or finish at 5:00pm). Often, however, there are no such explicit markers. The 
processes employed in those cases by hearers to determine the speaker's view are not yet 
fully understood. In an Nlitdb, however, where questions refer to a restricted domain, 
reasonable guesses can be made by observing that in each domain, each verb tends to be 
associated mainly with one particular view. Certain agreements about how situations 
are to be viewed (e.g. that some situations are to be treated as instantaneous - point 
view) will also have been made during the design of the database. These agreements 
provide additional information about how the situations of the various verbs are viewed 
in each domain. 

More precisely, the following approach is adopted in this thesis. Whenever the Nlitdb 
is configured for a new application domain, the base form of each verb is assigned to one 



of the four aspectual classes, using criteria to be discussed in section 2.4. These criteria 



are intended to detect the view that is mainly associated with each verb in the particu- 



lar domain that is being examined. Following [ Dowty 86 1, [ Moens 87 ], [ Vlach 93 ], and 
others, aspectual class is treated as a property of not only verbs, but also verb phrases, 
clauses, and sentences. Normally, all verb forms will inherit the aspectual classes of 
the corresponding base forms. Verb phrases, clauses, or sentences will normally inherit 
the aspectual classes of their main verb forms. Some linguistic mechanisms (e.g. the 
progressive or some temporal adverbials), however, may cause the aspectual class of a 
verb form to differ from that of the base form, or the aspectual class of a verb phrase, 
clause, or sentence to differ from that of its main verb form. The aspectual class of 
each verb phrase, clause, or sentence is intended to reflect the view that users typically 
have in mind when using that expression in the particular domain. 

In the case of a verb like "to run", that typically involves a culminating activity view 
when used with an expression that specifies a destination or specific distance (e.g. "to 
run to the station/Eve miles"), but an activity view when used on its own, it will be 
assumed that there are two different homonymous verbs "to run". One has a culmi- 
nating activity base form, and requires a complement that specifies a destination or 
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specific distance. The other has an activity base form, and requires no such comple- 
ment. A similar distinction would be introduced in the case of verbs whose aspectual 
class depends on whether or not the verb's object denotes a countable or mass entity 



(e.g. "to drink a bottle of wine" vs. "to drink wine"; see ( Mourelatos 78|). 



Similarly, when a verb can be used in a domain with both habitual and non-habitual 
meanings (e.g. "BA737 (habitually) departs from Gatwick." vs. "BA737 (actually) 
departed from Gatwick five minutes ago."), a distinction will be made between a 
homonym with a habitual meaning, and a homonym with a non-habitual meaning.^ 
The base forms of habitual homonyms are classified as states. (This agrees with 
Vendler, Vlach [ Vlach 9^| , and Moens and Steedman | Moens fc Steedman 88 ], who all 



classify habituals as states.) The aspectual classes of non-habitual homonyms depend 
on the verb and the application domain. Approaches that do not postulate homonyms 
are also possible (e.g. claiming that "to run" is an activity which is transformed into 
a culminating activity by "the station"). The homonyms method, however, leads to a 
more straight forward treatment in the Hpsg grammar of chapter ^| (where the base 
form of each homonym is mapped to a different sign) . 

In the rest of this thesis, I refer to verbs whose base forms are classified as states, 
activities, culminating activities, or points as state verbs, activity verbs, culminating 
activity verbs, and point verbs. 



2.4 Criteria for classifying base verb forms 



This section discusses the criteria that determine the aspectual class of a verb's base 
form in a particular Nlitdb domain. Three criteria are employed, and they are applied 



in the order of figure 2.1 



2.4.1 The simple present criterion 

The first criterion distinguishes state verbs (verbs whose base forms are states) from 
point, activity, and culminating activity verbs. If the simple present of a verb can be 
used (in the particular domain) in single-clause questions with non-futurate meanings, 
the verb is a state one; otherwise it is a point, activity, or culminating activity verb. 

1 When discussing sentences with multiple readings, I often use parenthesised words (e.g. "(habitu- 
ally)") to indicate which reading is being considered. 
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simple present criterion 




point imperfective paradox criterion 



activity culminating activity 

Figure 2.1: Determining the aspectual class of a verb's base form 



For example, in domains where ([O]) and ( |2.2[ ) are possible, "to contain" and "to own" 
are state verbs. 

(2.1) Does any tank contain oil? 

(2.2) Which employees own a car? 

Some clarifications are needed. First, the simple present sometimes refers to something 
that is scheduled to happen. For example, (^3|) could refer to a scheduled assembling 



(in that case, (]2.3|) is very similar to (2.4)). I consider this meaning of Q2.3|) futurate 



Hence, this use of (12.31) does not constitute evidence that "to assemble" is a state verb. 



(2.3) When does J.Adams assemble engine 5? 

(2.4) When will J.Adams assemble engine 5? 

In reporting contexts, the simple present of verbs whose base forms I would not want 
to be classified as states can be used with a non-futurate meaning. For example, in a 
context where the speaker reports events as they happen, ( |2.5[ ) is possible. (This use 
of the simple present is unlikely in Nlitdb questions.) 

(2.5) J.Adams arrives. He moves the container. He fixes the engine. 

The simple present criterion examines questions directed to a Nlitdb, not sentences 
from other contexts. Hence, ( [2.5| ) does not constitute evidence that "to arrive", "to 
move", and "to fix" are state verbs. 

The reader is reminded that when verbs have both habitual and non-habitual meanings, 
I distinguish between habitual and non-habitual homonyms (section |2.3| ). Ignoring 
scheduled-to-happen meanings (that do not count for the simple present criterion), 
(|2.6| ) and ( |2.7| ) can only have habitual meanings. 
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(2.6) Which flight lands on runway 2? 

(2.7) Does any doctor smoke? 
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|2.6| ) asks for a flight that habitually lands on runway 2, and ( |2.7| ) for doctors that 



are smokers. That is, (|2.6j ) and (2/7) can only be understood as involving the habitual 



homonyms of "to land" and "to smoke". (In contrast, ( |2.8[ ) and (2.9) can be understood 



with non-habitual meanings, i.e. as involving the non-habitual homonyms.) 

(2.8) Which flight is landing on runway 2? 

(2.9) Is any doctor smoking? 



Therefore, in domains where ( |2.6|) and ( |2.7D are possible, the habitual "to land" and 
"to smoke" are state verbs. (|2.6|) and (^3) do not constitute evidence that the non- 
habitual "to land" and "to smoke" are state verbs. 

2.4.2 The point criterion 

The second criterion, the point criterion, distinguishes point verbs from activity and 
culminating activity ones (state verbs will have already been separated by the sim- 
ple present criterion; see figure |2.1| ). The point criterion is based on the fact that 
some verbs will be used to describe kinds of world situations that are modelled in the 
database as being always instantaneous. If a verb describes situations of this kind, its 
base form should be classified as point; otherwise, it should be classified as activity or 
culminating activity. 



In section 2.4.5, for example, I consider a hypothetical airport database. That database 
does not distinguish between the times at which a flight starts or stops entering an 
airspace sector. Entering a sector is modelled as instantaneous. Also, in the airport 
domain "to enter" is only used to refer to flights entering sectors. Consequently, in that 
domain "to enter" is a point verb. If "to enter" were also used to refer to, for example, 
groups of passengers entering planes, and if situations of this kind were modelled in the 
database as non-instantaneous, one would have to distinguish between two homonyms 
"to enter", one used with flights entering sectors, and one with passengers entering 
planes. The first would be a point verb; the second would not. 

The person applying the criterion will often have to decide exactly what is or is not 
part of the situations described by the verbs. The database may store, for example, 
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the time-points at which a flight starts to board, finishes boarding, starts to taxi to 
a runway, arrives at the runway, and leaves the ground. Before classifying the non- 
habitual "to depart", one has to decide exactly what is or is not part of departing. Is 
boarding part of departing, i.e. is a flight departing when it is boarding? Is taxiing 
to a runway part of departing? Or does departing include only the time at which the 
flight actually leaves the ground? If a flight starts to depart when it starts to board, 
and finishes departing when it leaves the ground, then the base form of "to depart" 
should not be classified as point, because the database does not treat departures as 
instantaneous (it distinguishes between the beginning of the boarding and the time 
when the flight leaves the ground). If, however, departing starts when the front wheels 
of the aircraft leave the ground and finishes when the rear wheels leave the ground, the 
base form of "to depart" should be classified as point, because the database does not 
distinguish the two times. In any case, the user should be aware of what "to depart" 
is taken to mean. 



The point criterion is similar to claims in fVendlcr 67 1, | Singh fc Singh 92 ], | Vlach 



and elsewhere that achievement (point) verbs denote instantaneous situations. 
2.4.3 The imperfective paradox criterion 

The third criterion distinguishes activity from culminating activity verbs (state and 
point verbs will have already been separated by the point and simple present crite- 
ria). The criterion is based on the imperfective paradox (section 1.2.2[) . Assertions 



containing the past continuous and simple past of the verbs, like ( [2.10 ) - ( 2.13 ), are 
considered. 

(2.10) John was running. 

(2.11) John ran. 

(2.12) John was building a house. 

(2.13) John built a house. 



The reader is reminded that assertions are treated as yes/no questions (section L4). If 
an affirmative answer to the past continuous assertion implies an affirmative answer to 
the simple past assertion (as in ( |2.10| ) - ( 2.11[ )), the verb is an activity one; otherwise 



(e.g. (2.12) - ( 2.13] )), it is a culminating activity one. 
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As will be discussed in section |2.5.3| , the past continuous sometimes has a futurate 
meaning. Under this reading, ( |2.10| ) means "John was going to run.", and an affir- 
mative answer to ( |2. 1Q| ) does not necessarily imply an affirmative answer to ( 2.1 1] ) . 



When applying the imperfective paradox criterion, the past continuous must not have 
its futurate meaning. In various forms, the imperfective paradox criterion has been 



used in []Vendler 67fl , jVlach 93fl , ||Kent 93| , and elsewhere. 



2.4.4 Other criteria 

The three criteria above are not the only ones that could be used. The behaviour 
of verbs when appearing in various forms or when combining with some temporal 
adverbials varies depending on their aspectual classes. Alternative criteria can be 
formulated by observing this behaviour. For example, some authors classify verbs (or 
situations denoted by verbs) by observing how easily they appear in progressive forms 
(to be discussed in section |2.5.3| ), how easily they combine with "for ..." and "in ..." 
duration adverbials (sections 2.9.3| and 2.9.4| below), or what the verbs entail about the 



start or the end of the described situation when they combine with "at ... " temporal 
adverbials (section [2.9.l| below). In some cases, the person classifying the base verb 
forms may be confronted with a verb for which the three criteria of sections 2.4. l| - 12.4.3 



do not yield a clear verdict. In such cases, additional evidence for or against classifying 
a base verb form into a particular class can be found by referring to following sections, 
where the typical behaviour of each class is examined. 

2.4.5 Classifying base verb forms in the airport domain 



To illustrate the use of the criteria of sections [2.4.l| - 12,4.3|, I now consider a hypothetical 



Nlitdb to a temporal database that contains information about the air-traffic of an 



airport. (I borrow some terminology from | Sripada et al. 94 }. The airport domain will 



be used in examples throughout this thesis.) The airport database shows the times 
when flights arrived at, or departed from, the airport, the times flights spent circling 
around the airport while waiting for permission to land, the runways they landed on 
or took off from, the gates where the flights boarded, etc. The database is occasionally 
queried using the Nlitdb to determine the causes of accidents, and to collect data 
that are used to optimise the airport's traffic-handling strategies. 
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The airport's airspace is divided into sectors. Flights approaching or leaving the airport 
cross the boundaries of sectors, each time leaving a sector and entering another one. 
The airport is very busy, and some of its runways may also be closed for maintenance. 
Hence, approaching flights are often instructed to circle around the airport until a 
runway becomes free. When a runway is freed, flights start to land. Landing involves 
following a specific procedure. In some cases, the pilot may abort the landing procedure 
before completing it. Otherwise, the flight lands on a runway, and it then taxies to 
a gate that is free. The moment at which the flight reaches the gate is considered 
the time at which the flight arrived (reaching a location and arriving are modelled as 
instantaneous). Normally (habitually) each flight arrives at the same gate and time 
every day. Due to traffic congestion, however, a flight may sometimes arrive at a gate 
or time other than its normal ones. 

Before taking off, each flight is serviced by a service company. This involves carrying out 
a specific set of tasks. Unless all tasks have been carried out, the service is incomplete. 
Each service company normally (habitually) services particular flights. Sometimes, 
however, a company may be asked to service a flight that it does not normally service. 
After being serviced, a flight may be inspected. Apart from flights, inspectors also 
inspect gates and runways. In all cases, there are particular tasks to be carried out for 
the inspections to be considered complete. 

Shortly before taking off, flights start to board. Unless all the passengers that have 
checked in enter the aircraft, the boarding is not complete, and the flight cannot 
depart. (There are special arrangements for cases where passengers are too late.) 
The flight then leaves the gate, and that moment is considered the time at which 
the flight departed (leaving a location and departing are modelled as instantaneous). 
Normally (habitually) each flight departs from the same gate at the same time every 
day. Sometimes, however, flights depart from gates, or at times, other than their 
normal ones. After leaving its gate, a flight may be told to queue for a particular 
runway, until that runway becomes free. When the runway is free, the flight starts to 
take off, which involves following a specific procedure. As with landings, the pilot may 
abort the taking off procedure before completing it. 

The database also records the states of parts of the airport's emergency system. There 
are, for example, emergency tanks, used by the fire-brigade. Some of those may contain 
water, others may contain foam, and others may be empty for maintenance. 
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state verbs 


activity verbs 


culm, activity verbs 


point verbs 


service (habitually) 


circle 


land 


cross 


arrive (habitually) 


taxi (no destination) 


take off 


enter 


depart (habitually) 


queue 


service (actually) 


become 


contain 




inspect 


start /begin 


be (non-auxiliary) 




board 


stop/finish 






taxi (to destination) 


reach 








leave 








arrive (actually) 








depart (actually) 



Table 2.1: Verbs of the airport domain 



Table 2.1 shows some of the verbs that are used in the airport domain. "To depart", 
"to arrive", and "to service" are used with both habitual and non-habitual meanings. 
(pl4[ ) and (pl^) , for example, can have habitual meanings. In ( [2.15D and fl2.17| ), 
the verbs are probably used with their non-habitual meanings. I distinguish between 
habitual and non-habitual homonyms of "to depart", "to arrive", and "to service" 
(section f2~3| ). 

(2.14) Which nights depart/arrive at 8:00am? 

(2.15) Which flight departed/arrived at 8:00am yesterday? 

(2.16) Which company services BA737? 

(2.17) Which company serviced BA737 yesterday? 



I also distinguish between two homonyms of "to taxi", one that requires a destination- 
denoting complement (as in "BA737 was taxiing to gate 2"), and one that requires no 
such complement (as in "BA737 was taxiing."). 



The simple present criterion and sentences like ( 2.18| ), ( |2.19| ), Q2.14 ), and ( 2.16j ) imply 
that the non-auxiliary "to be", "to contain", and the habitual "to depart", "to arrive", 
and "to service" are state verbs. 



(2.18) Which gates are free? 

(2.19) Does any tank contain foam? 



All other verbs of table |2.l| are not state verbs. For example, (excluding habitual and 
futurate meanings) ( |2.20| ) - ( |2.22| ) sound unlikely or odd in the airport domain. ( 2.23 ) 
- ( 2.25 ) would be used instead. 
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(2.20) ? Which flights circle? 

(2.21) ? Which flight taxies to gate 2? 

(2.22) ? Which flight departs? 

(2.23) Which flights are circling? 

(2.24) Which flight is taxiing to gate 2? 

(2.25) Which flight is departing? 



The verbs in the rightmost column of table 2.1 are used in the airport domain to refer to 



situations which I assume are modelled as instantaneous in the database. Consequently, 
by the point criterion these are all point verbs. In contrast, I assume that the situations 
of the verbs in the two middle columns are not modelled as instantaneous. Therefore, 
those are activity or culminating activity verbs. 



In the airport domain, a sentence like ( 2.27 ) means that BA737 spent some time circling 



around the airport. It does not imply that BA737 completed any circle around the 
airport. Hence, an affirmative answer to ( 2.26 ) implies an affirmative answer to (|2~27| ). 



By the imperfective paradox criterion, "to circle" is an activity verb. 

(2.26) BA737 was circling. 

(2.27) BA737 circled. 

Similar assertions and the imperfective paradox criterion imply that "to taxi" (no 
destination) and "to queue" are also activity verbs. In contrast, the verbs in the third 



column of table 2.1 are culminating activity verbs. For example, in the airport domain 



an affirmative answer to ( 2.28; ) does not imply an affirmative answer to ( |2T2g| ): J.Adams 



may have aborted the inspection before completing all the inspection tasks, in which 
case the inspection is incomplete. 

(2.28) J.Adams was inspecting runway 5. 

(2.29) J.Adams inspected runway 5. 

2.5 Verb tenses 



I now turn to verb tenses. I use "tense" with the meaning it has in traditional English 
grammar textbooks (e.g. [ Thomson fc Martinet 8(1 ). For example, "John sings." and 
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"John is singing." will be said to be in the simple present and present continuous 
tenses respectively. In linguistics, "tense" is not always used in this way. According to 
JComrie"85 ], for example, "John sings." and "John is singing." are in the same tense, 



but differ aspectually. 

Future questions are not examined in this thesis (section |1.4j ). Hence, future tenses 
and futurate meanings of other tenses (e.g. the scheduled-to-happen meaning of the 
simple present; section 2.4.l| ) will be ignored. To simplify further the linguistic data, the 



present perfect continuous and past perfect continuous (e.g. "has/had been inspecting") 
were also not considered: these tenses combine problems from both continuous and 
perfect tenses. This leaves six tenses to be discussed: simple present, simple past, 
present continuous, past continuous, present perfect, and past perfect. 

2.5.1 Simple present 

The framework of this thesis allows the simple present to be used only with state verbs, 
to refer to a situation that holds at the present (e.g. ( 2.3C| ), fl2.3l|) ). 



(2.30) Which runways are closed? 

(2.31) Does any tank contain water? 



Excluding the scheduled-to-happen meaning (which is ignored in this thesis), (|2.32 ) 



can only be understood as asking for the current normal (habitual) servicer of BA737. 
Similarly, ( 2.33j ) can only be asking for the current normal departure gate of BA737. 



( 2.32| ) would not be used to refer to a company that is actually servicing BA737 at the 



present moment (similar comments apply to ( 2,33j )). That is, ( 2.32|) and (|2.33 ) can only 



involve the habitual homonyms of "to service" and "to depart" (which are state verbs), 
not the non-habitual ones (which are culminating activity and point verbs respectively; 



see table 2.1). This is consistent with the assumption that the simple present can only 



be used with state verbs. 

(2.32) Which company services BA737? 

(2.33) Which flights depart from gate 2? 



In the airport domain, "to circle" is an activity verb (there is no state habitual 
homonym). Hence, (|2 . 34 ) is rejected. This is as it should be, because Q2.34 ) can 
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only be understood with a habitual meaning, a meaning which is not available in the 
airport domain (there are no circling habits). 

(2.34) Does BA737 circle? 

The simple present can also be used with non-state verbs to describe events as they 
happen (section |2.4.l| ), or with a historic meaning (e.g. "In 1673 a fire destroys the 
palace."), but these uses are extremely unlikely in Nlitdb questions. 

2.5.2 Simple past 

Like the simple present, the simple past can be used with verbs from all four classes 
(e.g. O - O). 



(2.35) Which tanks contained water on 1/1/95? 

(2.36) Did BA737 circle on 1/1/95? 

(2.37) Which flights (actually) departed from gate 2 on 1/1/95? 

(2.38) Which flights (habitually) departed from gate 2 in 1994? 

(2.39) Which company (actually) serviced BA737 yesterday? 

(2.40) Which company (habitually) serviced BA737 last year? 

(|2.37| ) - (|2.40D show that both the habitual and the non-habitual homonyms of verbs 



like "to depart" or "to service" are generally possible in the simple past. ( 2.41 ) is 
ambiguous. It may refer either to flights that actually departed (perhaps only once) 
from gate 2 in 1994, or to flights that normally (habitually) departed from gate 2 in 
1994. 

(2.41) Which flights departed from gate 2 in 1994? 

The simple past of culminating activity verbs normally implies that the situation of the 
verb reached its climax. For example, in ( 2.42; ) the service must have been completed, 



and in ( 2.43] ) BA737 must have reached gate 2 for the answer to be affirmative. 



(2.42) Did Airserve service BA737? 

(2.43) Did BA737 taxi to gate 2? 

(2.44) BA737 was taxiing to gate 2 but never reached it. 
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Some native English speakers consider simple negative answers to ( p. 42 ) and (2.43) 



unsatisfactory, if for example BA737 was taxiing to gate 2 but never reached it. Al- 
though they agree that strictly speaking the answer should be negative, they consider 
(2.44) a much more appropriate answer. To generate answers like ( [2.44| ), a mechanism 



for cooperative responses is needed, an issue not addressed in this thesis (section |l 



The simple past (and other tenses) often has an anaphoric nature. For example, ( 2.42 ) 



probably does not ask if Airserve serviced BA737 at any time in the past. ( 2.42 ) 



would typically be used with a particular time in mind (e.g. the present day), to ask 



if Airserve serviced BA737 during that time. As will be discussed in section [2.12| , a 
temporal anaphora resolution mechanism is needed to determine the time the speaker 
has in mind. The framework of this thesis currently provides no such mechanism, and 
( 2,42j ) is taken to refer to any past time. (The same approach is adopted with all other 



tenses that refer to past situations.) 

2.5.3 Present continuous and past continuous 

Futurate meanings: The present and past continuous can be used with futurate 
meanings. In that case, ( ^45|) is similar to (|2.46|) . 



(2.45) Who is/was inspecting BA737? 

(2.46) Who will/would inspect BA737? 

Futurate meanings of tenses are not examined in this thesis. Hence, this use of the 
present and past continuous will be ignored. 

Activity and culminating activity verbs: The present and past continuous can 
be used with activity and culminating activity verbs to refer to a situation that is or 
was in progress (e.g. fl2.47[ ) - ( 2.48 ) from the airport domain). 



(2.47) Are/Were any flights circling? 

(2.48) Is/Was BA737 taxiing to gate 2? 



In the case of culminating activity verbs, there is no requirement for the climax to 
be reached at any time. The past continuous version of ( |2.48j ), for example, does not 
require BA737 to have reached gate 2 (cf. (fc43|)). 
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Point verbs: The present and past continuous of point verbs often refers to a 
preparatory process that is or was ongoing, and that normally leads to the instan- 
taneous situation of the verb. For example, in the airport domain where "to depart" is 
a point verb and departing includes only the moment where the flight leaves its gate, 



one could utter ( p,49|) when the checking-in is ongoing or when the flight is boarding. 
(2.49) BA737 is departing. 

The framework of this thesis provides no mechanism for determining exactly which 
preparatory process is asserted to be in progress. (Should the checking-in be in progress 
for the response to ( |2.49| ) to be affirmative? Should the boarding be ongoing?) Hence, 
this use of the present and past continuous of point verbs is not allowed. The response 
to ( 2.4S| ) will be affirmative only at the time-point where BA737 is leaving its gate (as 



will be discussed in section 5.2.2, the database may model time-points as corresponding 
to minutes or even whole days). To avoid misunderstandings, the Nlitdb should warn 
the user that ( p. 49 ) is taken to refer to the actual (instantaneous) departure, not to 



any preparatory process. This is again a case for cooperative responses, an issue not 
examined in this thesis. 



State verbs: It has often been observed (e.g. Vendler's tests in section |2.2|) that 
state verbs are not normally used in progressive forms. For example, ( 2.50| ) and (2.52) 
are easily rejected by most native speakers. (I assume that "to own" and "to consist" 
would be classified as state verbs, on the basis that simple present questions like (2.51) 



and ( 2.53 ) are possible.) 

*Who is owning five cars? 

Who owns five cars? 
* Which engine is consisting of 34 parts? 



(2.50) 
(2.51) 
(2.52) 
(2.53) 



Which engine consists of 34 parts? 



? 



The claim that state verbs do not appear in the progressive is challenged by sentences 
like ( |2.54 ) (from [ |5mith 86| ], cited in [Passonneau 88]; [Kent 93] and [Vlach 93] provide 
similar examples). ( |2.54| ) shows that the non-auxiliary "to be", which is typically 
classified as state verb, can be used in the progressive. 



(2.54) My daughter is being very naughty. 
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Also, some native English speakers find Q2.55D and ( p. 56 ) acceptable (though they 



would use the non-progressive forms instead). (I assume that "to border" would be 
classified as state verb, on the basis that "Which countries border France?" is possible.) 

(2.55) ?Tank 4 was containing water when the bomb exploded. 

(2.56) ?Which countries were bordering France in 1937? 



Not allowing progressive forms of state verbs also seems problematic in questions like 
(|2.57|) . (|2.57|) has a reading which is very similar to the habitual reading of ( 2.58 ) 
(habitually serviced BA737 in 1994). 



(2.57) Which company was servicing BA737 in 1994? 

(2.58) Which company serviced (habitually) BA737 in 1994? 

The reader is reminded that in the airport domain I distinguish between a habitual and 
a non-habitual homonym of "to service". The habitual homonym is a state verb, while 
the non-habitual one is a culminating activity verb. If progressive forms of state verbs 
are not allowed, then only the non-habitual homonym (actually servicing) is possible 
in ( |2.57| ). This does not account for the apparently habitual meaning of (jT57D . 



One could argue that the reading of ( 2.57 ) is not really habitual but iterative (servicing 



many times, as opposed to having a servicing habit). As pointed out in [pomric 76 ] 



(p. 27), the mere repetition of a situation does not suffice for the situation to be 
considered a habit. ( 2.59j ), for example, can be used when John is banging his hand 



on the table repeatedly. In this case, it seems odd to claim that ( [2.59| ) asserts that 
John has the habit of banging his hand on the table, i.e. ( |2.59D does not seem to be 
equivalent to the habitual ( 2.60| ). 



(2.59) John is banging his hand on the table. 

(2.60) John (habitually) bangs his hand on the table. 



In sentences like ( |2.57| ) - ( |2.58 ), however, the difference between habitual and iterative 



meaning is hard to define. For simplicity, I do not distinguish between habitual and 
iterative readings, and I allow state verbs to be used in progressive forms (with the same 
meanings as the non-progressive forms). This causes ( 2.57j ) to receive two readings: 



one involving the habitual "to service" (servicing habitually in 1994), and one involving 
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the non-habitual "to service" (actually servicing at some time in 1994; this reading is 
more likely without the "in 1994"). ( 2.50| ) and ( 2.52j ) are treated as equivalent to (2.51) 



and ( |2l33D 



As will be discussed in section 2.9, I assume that progressive tenses cause an aspec- 
tual shift from activities and culminating activities to states. In the airport domain, 
for example, although the base form of "to inspect" is a culminating activity, "was 
inspecting" is a state. 

2.5.4 Present perfect 

Like the simple past, the present perfect can be used with verbs of all four classes 
to refer to past situations (e.g. ( 2.61| ) - ( |2.65[ )). With culminating activity verbs, 



the situation must have reached its climax (e.g. in ( |2.65| ) the service must have been 
completed) . 

(2.61) Has BA737 (ever) been at gate 2? 

(2.62) Which flights have circled today? 

(2.63) Has BA737 reached gate 2? 

(2.64) Which company has (habitually) serviced BA737 this year? 

(2.65) Has Airserve (actually) serviced BA737? 



It has often been claimed (e.g. [Moens 87], |Vlach 93], | Blackburn et al. 94fl ) that the 
English present perfect asserts that some consequence of the past situation holds at 
the present. For example, ( 2.66j ) seems to imply that there is a consequence of the fact 



that engine 5 caught fire that still holds (e.g. engine 5 is still on fire, or it was damaged 
by the fire and has not been repaired). In contrast, fl2.67j ) does not seem to imply (at 
least not as strongly) that some consequence still holds. 

(2.66) Engine 5 has caught fire. 

(2.67) Engine 5 caught fire. 



Although these claims are intuitively appealing, it is difficult to see how they could 
be used in a Nlitdb. Perhaps in ( |2.68| ) the Nlitdb should check not only that the 
landing was completed, but also that some consequence of the landing still holds. 
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(2.68) Has BA737 landed? 



It is unclear, however, what this consequence should be. Should the Nlitdb check that 
the plane is still at the airport? And should the answer be negative if the plane has 
departed since the landing? Should the Nlitdb check that the passengers of BA737 
are still at the airport? And should the answer be negative if the passengers have left 
the airport? Given this uncertainty, the framework of this thesis does not require the 
past situation to have present consequences. 

When the present perfect combines with "for ..." duration adverbials (to be discussed 



in section |2.9.3 ), there is often an implication that the past situation still holds at 
the present (this seems related to claims that the past situation must have present 
consequences). For example, there is a reading of ( 2,6S| ) where J.Adams is still a 



manager. (( |2.69| ) can also mean that J.Adams was simply a manager for two years, 



without the two years ending at the present moment.) In contrast, ( 2.70 ) carries no 
implication that J.Adams is still a manager (in fact, it seems to imply that he is no 
longer a manager). 

(2.69) J.Adams has been a manager for two years. 

(2.70) J.Adams was a manager for two years. 

Representing in Top the still-holding reading of sentences like ( |2.69| ) has proven diffi- 
cult. Hence, I ignore the possible implication that the past situation still holds, and I 
treat (|2.69| ) as equivalent to fl2.7CQ . 

The present perfect does not combine felicitously with some temporal adverbials. For 
example, ( p. 71 ) and ( |2.74| ) sound at least odd to most native English speakers (they 



would use ( |2.72| ) and ( |2.T5| ) instead). In contrast, fl2.73| ) and fl2.76| ) are acceptable. 

(2.71) ? Which flights have landed yesterday? 

(2.72) Which flights landed yesterday? 

(2.73) Which flights have landed today? 

(2.74) ? Which flights has J.Adams inspected last week? 

(2.75) Which flights did J.Adams inspect last week? 

(2.76) Which flights has J.Adams inspected this week? 
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( 2.71] ) - ( |2.76| ) suggest that the present perfect can only be used if the time of the 



adverbial contains not only the time where the past situation occurred, but also the 
speech time, the time when the sentence was uttered. (A similar explanation is given on 
p. 167 of [ Thomson &: Martinet 86] ].) Q2.73 ) is felicitous, because "today" contains the 



speech time. In contrast, ( 2.71 ) is unacceptable, because "yesterday" cannot contain 
the speech time. The hypothesis, however, that the time of the adverbial must include 
the speech time does not account for the fact that ( |2.77| ) is acceptable by most native 
English speakers (especially if the "ever" is added), even if the question is not uttered 
on a Sunday. 

(2.77) Has J.Adams (ever) inspected BA737 on a Sunday? 



As pointed out in [Moens fc Steedman 88], a superstitious person could also utter 



(|2.78j ) on a day other than Friday the 13th. 
(2.78) They have married on Friday 13th! 

One could attempt to formulate more elaborate restrictions, to predict exactly when 
temporal adverbials can be used with the present perfect. In the case of a Nlitdb, 
however, it is difficult to see why this would be worth the effort, as opposed to simply 



accepting questions like ( 2.71 ) as equivalent to ( 2.72 ). I adopt the latter approach. 



Given that the framework of this thesis does not associate present consequences with 
the present perfect, that the still-holding reading of sentences like ( |2.69| ) is not sup- 
ported, and that questions like Q2.71 ) are allowed, there is not much left to distinguish 



the present perfect from the simple past. Hence, I treat the present perfect as equiva- 
lent to the simple past. 

2.5.5 Past perfect 

The past perfect is often used to refer to a situation that occurred at some past time 
before some other past time. Following Reichenbach |Reichenbach 47 1 and many others, 



let us call the latter time the reference time. ( |2.79| ) and ( |2.8C| ) have readings where 
"at 5:00pm" specifies the reference time. In that case, ( |2.79| ) asks for flights that 
Airserve serviced before 5:00pm, and ( [2.80 ) asks if BA737 was at gate 2 some time 



before 5:00pm. (When expressing these meanings, "by 5:00pm" is probably preferable. 
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I claim, however, that "at 5:00pm" can also be used in this way. "By ... " adverbials 
are not examined in this thesis.) 

(2.79) Which flights had Airserve serviced at 5:00pm? 

(2.80) Had BA737 been at gate 2 at 5:00pm? 

With culminating activity verbs, the climax must have been reached before (or possibly 



at) the reference time. For example, in ( 2.79 ) the services must have been completed 



up to 5:00pm. Perhaps some consequence of the past situation must still hold at the 



reference time (see similar comments about the present perfect in section 2.5.4). As 
with the present perfect, however, I ignore such consequential links. 

When the past perfect combines with temporal adverbials, it is often unclear if the 
adverbial is intended to specify the reference time or directly the time of the past 



situation. For example, (2.81) could mean that BA737 had already reached gate 2 



at 5:00pm, or that it reached it at 5:00pm. In the latter case, Q2.81 ) is similar to 



the simple past ( |2.82j ), except that ( |2.81| ) creates the impression of a longer distance 
between the time of the reaching and the speech time. 

(2.81) BA737 had reached gate 2 at 5:00pm. 

(2.82) BA737 reached gate 2 at 5:00pm. 

When the past perfect combines with "for ... " duration adverbials and a reference 
time is specified, there is often an implication that the past situation still held at the 
reference time. (A similar implication arises in the case of the present perfect; see 
section For example, ( ^83| ) seems to imply that J.Adams was still a manager 



on 1/1/94. As in the case of the present perfect, I ignore this implication, for reasons 
related to the difficulty of representing it in Top. 

(2.83) J.Adams had been a manager for two years on 1/1/94. 



As will be discussed in section |2Jj , I assume that the past perfect triggers an aspectual 
shift to state (e.g. the base form of "to inspect" is a culminating activity, but "had 
inspected" is a state). This shift seems to be a property of all perfect tenses. For 
reasons, however, related to the fact that I treat the present perfect as equivalent to 
the simple past (section |2.5.4 ), I do not postulate any shift in the case of the present 
perfect. 



CHAPTER 2. THE LINGUISTIC DATA AND AN INFORMAL ACCOUNT 37 
2.6 Special temporal verbs 



Through their tenses, all verbs can convey temporal information. Some verbs, however, 
like "to begin" or "to precede", are of a more inherently temporal nature. These verbs 
differ from ordinary ones (e.g. "to build", "to contain") in that they do not describe 
directly situations, but rather refer to situations introduced by other verbs or nouns. 
(A similar observation is made in [ Passonneau 88| .) In ( [2.84 ), for example, "to begin" 



refers to the situation of "to build". "To start", "to end", "to finish", "to follow", "to 
continue", and "to happen" all belong to this category of special temporal verbs. 

(2.84) They began to build terminal 9 in 1985. 

From all the special temporal verbs, I have considered only "to start", "to begin", "to 
stop", and "to finish". I allow "to start", "to begin", "to end", and "to finish" to be 
used with state and activity verbs, even though with state verbs "to begin" and "to 
finish" usually sound unnatural (e.g. fl2.85|) ), and with activity verbs (e.g. ( |2.86| )) it 
could be argued that the use of "to begin" or "to finish" signals that the speaker has 
in mind a culminating activity (not activity) view of the situation. 

(2.85) ?Which tank began to contain/finished containing water on 27/7/95? 

(2.86) Which flight began/finished circling at 5:00pm? 

When combining with culminating activity verbs, "to start" and "to begin" have the 
same meanings. "To stop" and "to finish", however, differ: "to finish" requires the 
climax to be reached, while "to stop" requires the action or change to simply stop 
(possibly without being completed). For example, in ( |2.87 ) the service must have 



simply stopped, while in (|2.88[) it must have been completed. 

(2.87) Which company stopped servicing (actually) BA737 at 5:00pm? 

(2.88) Which company finished servicing (actually) BA737 at 5:00pm? 

With point verbs (like "to enter" and "to leave" in the airport domain), the use of "to 
start", "to begin", "to stop", and "to finish" (e.g. (HH), (]2T9C| )) typically signals that 



the person submitting the question is unaware that the situation of the point verb is 



taken to be instantaneous. In these cases, I ignore the temporal verbs (e.g. (|2.89|) is 
treated as ( |2.91| )). Ideally, the Nlitdb would also warn the user that the temporal 
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verb is ignored, and that the situation is modelled as instantaneous (another case for 
cooperative responses; see section |L4| ) . The framework of this thesis, however, provides 
no mechanism for generating such warnings. 

(2.89) Which flight started to enter sector 2 at 5:00pm? 

(2.90) Which flight finished leaving gate 2 at 5:00pm? 

(2.91) Which flight entered sector 2 at 5:00pm? 

2.7 Temporal nouns 

Some nouns have a special temporal nature. Nouns like "development" or "inspection" , 
for example, are similar to verbs, in that they introduce world situations that occur 
in time. The role of "development" in ( |2.92| ) is very similar to that of "to develop" in 
(PI )- 

(2.92) When did the development of Masque start? 

(2.93) When did they start to develop Masque? 

Other nouns indicate temporal order (e.g. "predecessor", "successor"), or refer to start 
or end-points (e.g. "beginning" , "end"). Finally, many nouns (and proper names) refer 
to time periods, points, or generally entities of the temporal ontology (e.g. "minute", 
"July", "event"). 

From all the temporal nouns (and proper names), this thesis examines only nouns 
like "year", "month", "week", "day", "minute", "second", "1993", "July", "1/1/85", 
"Monday", "3:05pm". Temporal nouns of a more abstract nature (e.g. "period", 
"point", "interval", "event", "time", "duration"), nouns referring to start or end- 
points, nouns introducing situations, and nouns of temporal precedence are not con- 
sidered. 

2.8 Temporal adjectives 

There are also adjectives of a special temporal nature. Some refer to a temporal 
order (e.g. "current", "previous", "earliest"), others refer to durations (e.g. "brief", 
"longer"), and others specify frequencies (e.g. "annual", "daily"). Adjectives of this 
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kind are not examined in this thesis, with the exception of "current" which is supported 
to illustrate some points related to temporal anaphora (these points will be discussed 
in section 2.11| ), ("Normal" adjectives, like "open" and "free", are also supported.) 



2.9 Temporal adverb ials 

This section discusses adverbials that convey temporal information. 
2.9.1 Punctual adverbials 



Some adverbials are understood as specifying time-points. Following [ Vlach 93fl , I call 



these punctual adverbials. In English, punctual adverbials are usually prepositional 
phrases introduced by "at" (e.g. "at 5:00pm", "at the end of the inspection"). In this 
thesis, only punctual adverbials consisting of "at" and clock-time expressions (e.g. "at 
5:00pm") are considered. 

With states: When combining with state expressions, punctual adverbials specify a 
time where the situation of the state expression holds. There is usually no implication 



that the situation of the state starts or stops at the time of the adverbial. (2.94), 
for example, asks if tank 5 was empty at 5:00pm. There is no requirement that the 
tank must have started or stopped being empty at 5:00pm. Similar comments apply 
to ( p5l) . 



(2.94) Was tank 5 empty at 5:00pm? 

(2.95) Which flight was at gate 2 at 5:00pm? 

In other words, with states punctual adverbials normally have an interjacent mean- 
ing, not an inchoative or terminal one. ("Interjacent", "inchoative", and "terminal" 
are borrowed from [Kent 93 1. Kent explores the behaviour of "at", "for", and "in" 



adverbials, and arrives at conclusions similar to the ones presented here.) 

In narrative contexts, punctual adverbials combining with states sometimes have in- 
choative meanings. For example, the "at 8:10am" in ( |2.96| ) most probably specifies the 
time when J.Adams arrived (started being) in Glasgow. In Nlitdb questions, how- 



ever, this inchoative meaning seems unlikely. For example, it seems unlikely that ( 2.97 ) 
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would be used to enquire about persons that arrived in Glasgow at 8:10am. Hence, 
for the purposes of this thesis, it seems reasonable to assume that punctual adverbials 
combining with states always have interjacent meanings. 

(2.96) J.Adams left Edinburgh early in the morning, and at 8:10am he was in Glas- 
gow. 

(2.97) Who was in Glasgow at 8:10am? 



With points: With point expressions, punctual adverbials specify the time where 
the instantaneous situation of the point expression takes place (e.g. ( 2.98) ) , ( |2.99 ); "to 
enter" and "to reach" are point verbs in the airport domain). 

(2.98) Which flight entered sector 2 at 23:02? 

(2.99) Which flight reached gate 5 at 23:02? 



(|2.100 ) is ambiguous. It may either involve the non-habitual homonym of "to depart" 
(this homonym is a point verb in the airport domain), in which case 5:00pm is the time 
of the actual departure, or the state habitual homonym (to depart habitually at some 
time), in which case 5:00pm is the habitual departure time. In the latter case, I treat 
"at 5:00pm" as a prepositional phrase complement of the habitual "to depart", not as 
a temporal adverbial. This reflects the fact that the "at 5:00pm" does not specify a 
time when the habit holds, but it forms part of the description of the habit, i.e. it is 



used in a way very similar to how "from gate 2" is used in (2.101) 



(2.100) Which flight departed at 5:00pm? 

(2.101) Which flight departed (habitually) from gate 2 (last year)? 



With activities: With activities, punctual adverbials usually have an inchoative 
meaning, but an interjacent one is also possible in some cases. ( |2~1Q2| ), for example, 
could refer to a flight that either joined the queue of runway 2 at 5:00pm or was 
simply in the queue at 5:00pm. (In the airport domain, "to queue" and "to taxi" (no 
destination) are activity verbs.) The inchoative meaning seems the preferred one in 
( |2.102[ ). It also seems the preferred one in (2.103), though an interjacent meaning is 
(arguably) also possible. The interjacent meaning is easier to accept in ( 2.104| ). 



(2.102) Which flight queued for runway 2 at 5:00pm? 
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(2.103) BA737 taxied at 5:00pm. 

(2.104) Which flights circled at 5:00pm? 



41 



With past continuous forms of activity verbs, however, punctual adverbials normally 
have only interjacent meanings (compare (|2~10^ ) and (|2l03D to ( |2~T05|) and ( El06| )). 
(One would not normally use punctual adverbials with present continuous forms, since 
in that case the situation is known to take place at the present.) 

(2.105) Which flight was queueing for runway 2 at 5:00pm? 

(2.106) BA737 was taxiing at 5:00pm. 



To account for sentences like ( p. 105] ) and fl2.106| ) (and other phenomena to be discussed 
in following sections), I classify the progressive tenses (present and past continuous) of 
activity and culminating activity verbs as states. For example, in the airport domain, 
the base form of "to queue" is an activity. Normally, all other forms of the verb (e.g. the 
simple past) inherit the aspectual class of the base form. The progressive tenses (e.g. "is 
queueing", "was queueing") of the verb, however, are states. (The progressive can be 
seen as forcing an aspectual shift from activities or culminating activities to states. No 
such aspectual shift is needed in the case of point verbs.) This arrangement, along with 
the assumption that punctual adverbials combining with states have only interjacent 



meanings, accounts for the fact that (2.105) and (2.106) have only interjacent meanings. 
In various forms, assumptions that progressive tenses cause aspectual shifts to states 
have also been used in [ Dowty 86fl , [Moens 87], [Vlach 93], ] |Kent 93| ], and elsewhere. 



With culminating activities: When combining with culminating activities, punc- 
tual adverbials usually have inchoative or terminal meanings (when they have terminal 
meanings, they specify the time when the climax was reached). The terminal reading 
is the preferred one in ( p. 107 ). In ( 2.108] ) both readings seem possible. In ( p7lQ9|) the 
inchoative meaning seems the preferred one. (In the airport domain, "to land", "to 
taxi" (to destination), and "to inspect" are culminating activity verbs.) 

(2.107) Which flight landed at 5:00pm? 

(2.108) Which flight taxied to gate 4 at 5:00pm? 

(2.109) Who inspected BA737 at 5:00pm? 
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Perhaps, as with activities, an interjacent meaning is sometimes also possible (e.g. 
(2.108) would refer to a flight that was on its way to gate 4 at 5:00pm). This may 



be true, but with culminating activities the inchoative or terminal reading is usually 
much more dominant. For simplicity, I ignore the possible interjacent meaning in the 
case of culminating activities. 

With past continuous forms of culminating activity verbs, punctual adverbials normally 
have only interjacent meanings. Compare, for example, Q2.107D - fl2.10Sf) to (j2.11CQ 



( 2.112[) . This is in accordance with the assumption that the progressive tenses of 



activity and culminating activity verbs are states. 

(2.110) Which flight was landing at 5:00pm? 

(2.111) Which flight was taxiing to gate 4 at 5:00pm? 

(2.112) Who was inspecting BA737 at 5:00pm? 



With past perfect: As discussed in section 2.5.5| , in sentences like ( |2.113| ) the ad- 



verbial can be taken to refer either directly to the taxiing (the taxiing started or ended 
at 5:00pm) or to the reference time (the taxiing had already finished at 5:00pm). 

(2.113) BA737 had taxied to gate 2 at 5:00pm. 

(2.114) BA737 had [taxied to gate 2 at 5:00pm]. 

(2.115) BA737 [had taxied to gate 2] at 5:00pm. 



The way in which sentences like (2.113) are treated in this thesis will become clearer 



in chapter ||. A rough description, however, can be given here. ( 2.113| ) is treated 



as syntactically ambiguous between ( |2,114| ) (where the adverbial applies to the past 
participle "taxied") and ( 2.115| ) (where the adverbial applies to the past perfect "had 



taxied"). The past participle ("taxied") inherits the aspectual class of the base form, 
and refers directly to the situation of the verb (the taxiing). In contrast, the past perfect 
( "had taxied") is always classified as state (the past perfect can be seen as causing an 
aspectual shift to state), and refers to a time-period that starts immediately after the 
end of the situation of the past participle (the end of the taxiing), and extends up to 
the end of time. Let us call this period the consequent period. 

In the airport domain, the base form of "to taxi" (to destination) is a culminating 
activity. Hence, the past participle "taxied" (which refers directly to the taxiing) 
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is also a culminating activity. In ( 2.114| ), a punctual adverbial combines with the 
(culminating activity) past participle. According to the discussion above, two readings 
arise: an inchoative (the taxiing started at 5:00pm) and a terminal one (the taxiing 
finished at 5:00pm). In contrast, in ( 2.115| ) the punctual adverbial combines with the 
past perfect "had taxied", which is a state expression that refers to the consequent 
period. Hence, only an interjacent reading arises: the time of the adverbial must 
simply be within the consequent period (there is no need for the adverbial's time to 
be the beginning or end of the consequent period). This requires the taxiing to have 
been completed at the time of the adverbial. A similar arrangement is used when 
the past perfect combines with period adverbials, duration "for ... " and "in ... " 
adverbials, or temporal subordinate clauses (to be discussed in following sections). The 
assumption that the past perfect causes an aspectual shift to state is similar to claims 
in [Moens_87], | Vlach 93[ , and elsewhere, that English perfect forms are (or refer to) 
states. 



Lexical, consequent, and progressive states: There is sometimes a need to dis- 
tinguish between expressions that are states because they have inherited the state 
aspectual class of a base verb form, and expressions that are states because of an as- 



pectual shift introduced by a past perfect or a progressive tense. Following Moens 87 ], 
I use the terms lexical state, consequent state, and progressive state to distinguish the 
three genres. In the airport domain, for example, the base form of "to queue" is a 
lexical state. The simple past "queued" and the past participle "queued" are also 
lexical states. The past perfect "had queued" is a consequent state, while the present 
continuous form "is queueing" is a progressive state. 



Finally, for reasons that will be discussed in section 4.1(: , I assume that punctual 
adverbials cause the aspectual class of the syntactic constituent they modify to become 



point. In ( 2.115 ), for example, the "taxied to gate 2" inherits the culminating activity 
aspectual class of the base form. The past perfect causes the aspectual class of "had 
taxied to gate 2" to become consequent state. Finally, the "at 5:00pm" causes the 



aspectual class of "had departed at 5:00pm" to become point. Table 2.2 summarises 
the main points of this section. 
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meanings of punctual adverbials 


with state 


interjacent 


with activity 


inchoative or interjacent 


with culm, activity 


inchoative or terminal 


with point 


specifies time of instantaneous situation 


The resulting aspectual class is point. 



Table 2.2: Punctual adverbials in the framework of this thesis 
2.9.2 Period adverbials 



Unlike punctual adverbials, which are understood as specifying points in time, ad- 
verbials like "in 1991", "on Monday", "yesterday" (e.g. ( pTT6| ) - (fni|)) are usually 



understood as specifying longer, non-instantaneous periods of time. In ( p. 116 ), for 



example, the period of "in 1991" covers the whole 1991. I call adverbials of this kind 
period adverbials. 

(2.116) Who was the sales manager in 1991? 

(2.117) Did BA737 circle on Monday? 

(2.118) Which nights did J.Adams inspect yesterday? 



"Before ..." and "after ..." adverbials (e.g. (|2T19D ) can also be considered period ad- 
verbials, except that in this case one of the boundaries of the period is left unspecified. 



(I model time as bounded; see section 3.3 below. In the absence of other constraints, 
I treat the unspecified boundary as the beginning or end of time.) In ( p.H9| ), for ex- 
ample, the end of the period is the beginning of 2/5/95; the beginning of the period is 
left unspecified. "Before" and "after" can also introduce temporal subordinate clauses; 



this will be discussed in section 2.10.2. 



(2.119) Which company serviced BA737 before 2/5/95? 



This thesis examines only period adverbials introduced by "in", "on", "before", and 
"after", as well as "today" and "yesterday". ( "In ... " adverbials can also specify 
durations, e.g. "in two hours"; this will be discussed in section 2.9.4| .) Other period 
adverbials, like "from 1989 to 1990", "since 1990", "last week", or "two days ago", 
are not considered. Extending the framework of this thesis to support more period 
adverbials should not be difficult. 
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With states: When period adverbials combine with state expressions, the situation 
of the state expression must hold for at least some time during the period of the 
adverbial. In ( 2.120| ), for example, the person must have been a manager for at least 



some time in 1995. Similarly, in (2.121), the person must have been at gate 2 for at 



least some time on the previous day. 

(2.120) Who was a manager in 1995? 

(2.121) Who was at gate 2 yesterday? 

There is often, however, an implication that the situation holds throughout the period 



of the adverbial. ( 2.122 ), for example, could mean that the tank was empty throughout 



January, not at simply some part of January. Similarly, in (2.123) the user could be 



referring to tanks that were empty throughout January. In that case, if a tank was 
empty only some days in January and the Nlitdb included that tank in the answer, 
the user would be misled to believe that the tank was empty throughout January. 
Similar comments can be made for Q2.124) and ( |2.125| ). 

(2.122) Tank 4 was empty in January. 

(2.123) Which tanks were empty in January? 

(2.124) Was runway 2 open on 6/7/95? 

(2.125) Which flights departed (habitually) from gate 2 in 1993? 

The same implication is possible in sentences with "before ... "or "after ... " adver- 
bials. ( 2.126j ), for example, could mean that the runway was open all the time from 



some unspecified time up to immediately before 5:00pm (and possibly longer). 
(2.126) Runway 2 was open before 5:00pm. 

One way to deal with such implications is to treat sentences where period adverbials 
combine with states as ambiguous. That is, to distinguish between a reading where the 
situation holds throughout the adverbial's period, and a reading where the situation 
holds at simply some part of the adverbial's period. | Vlach 93fl (p. 256) uses the terms 



durative and inclusive to refer to the two readings. (A NLITDB could paraphrase 
both readings and ask the user to select one, or it could provide answers to both 
readings, indicating which answer corresponds to which reading.) This approach has 
the disadvantage of always generating two readings, even in cases where the durative 



CHAPTER 2. THE LINGUISTIC DATA AND AN INFORMAL ACCOUNT 46 

reading is clearly impossible. For example, when the state expression combines not 
only with a period adverbial but also with a "for ..." duration adverbial, the meaning 
can never be that the situation must necessarily hold all the time of the adverbial's 
period. For example, ( |2.127 ) can never mean that the tank must have been empty 



throughout January (cf. ( 2.123|) ) 



(2.127) Which tank was empty for two days in January? 

(2.128) When on 6/7/95 was tank 5 empty? 



Similarly, in time-asking questions like (|2.128|) , the durative reading is impossible. 



( 2.128 ) can never mean that the tank must have been empty throughout 6/7/95 (cf. 
( 2.124 )). Formulating an account of exactly when the durative reading is possible is a 
task which I have not undertaken. Although in chapter [3| I discuss how the distinction 
between durative and inclusive readings could be captured in Top, for simplicity in 
the rest of this thesis I consider only the inclusive readings, ignoring the durative ones. 



With points: When period adverbials combine with point expressions, the period 
of the adverbial must contain the time where the instantaneous situation of the point 
expression occurs (e.g. ( |2.129| )). 

(2.129) Did BA737 enter sector 5 on Monday? 



With culminating activities: When period adverbials combine with culminating 
activity expressions, I allow two possible readings: (a) that the situation of the culmi- 
nating activity expression both starts and reaches its completion within the adverbial's 
period, or (b) that the situation simply reaches its completion within the adverbial's 
period. In the second reading, I treat the culminating activity expression as referring 
to only the completion of the situation it would normally describe, and the aspectual 
class is changed to point. 

The first reading is the preferred one in (fEl3Cl ) which is most naturally understood as 
referring to a runner who both started and finished running the 40 miles on Wednesday 
("to run" (a distance) is typically classified as culminating activity verb). 



(2.130) Who ran 40 miles on Wednesday? 
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In the airport domain, the first reading is the preferred one in ( p. 131] ) (the inspection 
both started and was completed on Monday). 

(2.131) J.Adams inspected BA737 on Monday. 

The second reading (the situation simply reaches its completion within the adverbial's 
period) is needed in questions like (2.132) and ( 2.133| ). In the airport domain, "to 



land" and "to take off" are culminating activity verbs (landings and taking offs involve 
following particular procedures; the landing or taking off starts when the pilot starts 
the corresponding procedure, and is completed when that procedure is completed). If 
only the first reading were available (both start and completion within the adverbial's 
period), in Q2.132 ) the Nlitdb would report only flights that both started and finished 



landing on Monday. If a flight started the landing procedure at 23:55 on Sunday and 
finished it at 00:05 on Monday, that flight would not be reported. This seems over- 



restrictive. In (2.132) the most natural reading is that the flights must have simply 
touched down on Monday, i.e. the landing must have simply been completed within 
Monday. Similar comments can be made for (2.133) and ( 2. 134] ) (in domains where 
"to fix" is a culminating activity verb). 

(2.132) Which flights landed on Monday? 

(2.133) Which flights took off after 5:00pm? 

(2.134) Did J.Adams fix any faults yesterday? 

The problem in these cases is that "to land", "to take off", and "to fix" need to 
be treated as point verbs (referring to only the time-points where the corresponding 
situations are completed), even though they have been classified as culminating activity 
verbs (section |2.4.5| ). The second reading allows exactly this. The culminating activity 
expression is taken to refer to only the completion point of the situation it would 
normally describe, its aspectual class is changed to point, and the completion point is 
required to fall within the adverbial's period. 

The fact that two readings are allowed when period adverbials combine with culmi- 
nating activities means that sentences like ( 2.131 ) - ( |2. 134 ) are treated as ambiguous. 



In all ambiguous sentences, I assume that a Nlitdb would present all readings to 
the user asking them to choose one, or that it would provide answers to all readings, 
showing which answer corresponds to which reading. (The prototype Nlitdb of this 
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thesis adopts the second strategy, though the mechanism for explaining which answer 
corresponds to which reading is primitive: the readings are shown as Top formulae.) 

In the case of "before ..." adverbials (e.g. ( |2.135| )), the two readings are semantically 
equivalent: requiring the situation to simply reach its completion before some time is 
equivalent to requiring the situation to both start and reach its completion before that 
time. To avoid generating two equivalent readings, I allow only the reading where the 
situation both starts and reaches its completion within the adverbial's period. 

(2.135) Which flights took off before 5:00pm? 

Even with the second reading, the answers of the Nlitdb may not always be satis- 
factory. Let us assume, for example, that J.Adams started inspecting a flight late on 
Monday, and finished the inspection early on Tuesday. None of the two readings would 
include that flight in the answer to ( |2.136| ), because both require the completion point 
to fall on Monday. While strictly speaking this seems correct, it would be better if the 
Nlitdb could also include in the answer inspections that partially overlap the adver- 
bial's period, warning the user about the fact that these inspections are not completely 
contained in the adverbial's period. This is another case where cooperative responses 
(section |L~4| ) are needed. 

(2.136) Which flights did J.Adams inspect on Monday? 

Finally, I note that although in the airport domain "to taxi" (to destination) is a 
culminating activity verb, in ( |2.137 ) the verb form is a (progressive) state. Hence, the 



Nlitdb 's answer would be affirmative if BA737 was taxiing to gate 2 some time within 
the adverbial's period (before 5:00pm), even if BA737 did not reach the gate during 
that period. This captures correctly the most natural reading of (|2.137| ). 

(2.137) Was BA737 taxiing to gate 2 before 5:00pm? 

With activities: When period adverbials combine with activities, I require the sit- 
uation of the verb to hold for at least some time within the adverbial's period (same 



meaning as with states). In ( 2.138 ), for example, the flight must have circled for at 



least some time on Monday, and in ( 2.139 ) the flights must have taxied for at least 
some time after 5:00pm. 
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meanings of period adverbials 


with state or activity 


situation holds for at least part of adverbial's period 


with culm, activity 


situation starts and is completed within adverbial's period, 
or situation is simply completed within adverbial's period*^ 


with point 


instantaneous situation occurs within adverbial's period 


*Not with "before ... " adverbials. 

TThe resulting aspectual class is point. (In all other cases the aspectual class 
remains the same.) 



Table 2.3: Period adverbials in the framework of this thesis 

(2.138) Did BA737 circle on Monday? 

(2.139) Which flights taxied after 5:00pm? 



Another stricter reading is sometimes possible (especially with "before" and "after"): 
that the situation does not extend past the boundaries of the adverbial's period. For 
example, ( 2.139D would refer to flights that started to taxi after 5:00pm (a flight that 
started to taxi at 4:55pm and continued to taxi until 5:05pm would not be reported). 



This reading is perhaps also possible with states (e.g. ( 2.126 )), though with activities 
it seems easier to accept. As a simplification, such readings are ignored in this thesis. 



Elliptical forms: "Before" and "after" are sometimes followed by noun phrases that 
do not denote entities of the temporal ontology (e.g. ( |2.14C| )). 

(2.140) Did J.Adams inspect BA737 before/after UK160? 

(2.141) Did J.Adams inspect BA737 before/after he inspected UK160? 



Questions like (2.140) can be considered elliptical forms of Q2.141 ), i.e. in these cases 



"before" and "after" could be treated as when they introduce subordinate clauses (sec- 
tion 2.10.2| below). Questions like ( |2.140| ) are currently not supported by the framework 



of this thesis. 



Table 2.3 summarises the main points of this section. 



2.9.3 Duration "for ... " adverbials 

This section discusses "for ... " adverbials that specify durations (e.g. ( |2.142j )), 
(2.142) Runway 2 was open for five days. 
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With states and activities: When "for ... " adverbials combine with states or 
activities, one reading is that there must be a period with the duration of the "for ..." 
adverbial, such that the situation of the state or activity holds throughout that period. 
According to this reading, in ( p.l43j ) there must be a five-year period, throughout 
which the person was a manager, and in ( 2.143| ) a twenty-minute period throughout 



which the flight was circling. If J.Adams was a manager for six consecutive years (e.g. 
1981 - 1986), he would be included in the answer to ( |2.143f ), because there is a five-year 
period (e.g. 1981 - 1985) throughout which he was a manager. 

(2.143) Who was a manager for five years? 

(2.144) Did BA737 circle for twenty minutes? 

In some cases, however, "for ... " adverbials are used with a stricter meaning: they 
specify the duration of a maximal period where a situation held. In that case, if 
J.Adams started to be a manager at the beginning of 1981 and stopped being a manager 
at the end of 1986 (six consecutive years), he would not be included in the answer to 



( 2.143 ). For simplicity, this stricter reading is ignored in this thesis 



In other cases, a "for ..." adverbial does not necessarily specify the duration of a single 
period, but a total duration. According to this reading, if J.Adams was a manager 
during several non-overlapping periods, and the total duration of these periods is five 
years, he would be included in the answer to ( |2.143| ), even if he was never a manager 
for a continuous five-year period. This reading of "for" adverbials is also not supported 
in this thesis. 

There is a problem if "for ... " adverbials are allowed to combine with consequent 
states (section [2. 9.1 ). This problem will be discussed in section 4.11.2, once some 



formal apparatus has been established. For the moment, I note that the solution 
involves disallowing "for ... " adverbials to be used with consequent states. 

With points: "For ... " adverbials sometimes specify the duration of a situation 
that follows the situation of the verb. This is particularly common when "for ... " 
adverbials combine with point expressions. For instance, ( |2,145| ) (based on an example 
from | Hwang &: Schubert 94| 1), probably does not mean that J.Adams was actually 



leaving his office for fifteen minutes. It means that he stayed (or intended to stay) out 
of his office for fifteen minutes. (I assume here that "to leave" is a point verb, as in 
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the airport domain.) This use of "for ... " adverbials is not supported in this thesis. 

(2.145) J.Adams left his office for fifteen minutes. 



"For ... " adverbials also give rise to iterative readings (section [2.5.3 ), This is again 



particularly common with point expressions. ( El46| ) (from [pwang & Schubert 94|1 ) 
probably means that Mary won several times ( "to win" is typically classified as point 
verb). Such iterative uses of "for ... " adverbials are not supported in this thesis. 

(2.146) Mary won the competition for four years. 

Excluding iterative readings and readings where "for ... " adverbials refer to conse- 
quent situations (both are not supported in this thesis), sentences where "for ... " 
adverbials combine with point expressions either sound odd or signal that the user is 
unaware that the situation of the point expression is modelled as instantaneous (an ex- 
planatory message to the user is needed in the latter case; this thesis, however, provides 
no mechanism to generate such messages). Hence, for the purposes of this thesis it 
seems reasonable not to allow "for ... " adverbials to combine with point expressions. 

With culminating activities: When "for ... " adverbials combine with culminat- 
ing activities, the resulting sentences sometimes sound odd or even unacceptable. For 
example, ( 2.147| ) (based on an example from [ Moens 87 1) sounds odd or unacceptable 



to most native English speakers ( "to build" is typically classified as culminating activ- 
ity verb). In contrast, fl2.148| ) where the adverbial combines with a (progressive) state 
is easily acceptable. 

(2.147) ?Housecorp built a shopping centre for two years. 

(2.148) Housecorp was building a shopping centre for two years. 

Based on similar examples, Vendler (section ^^) concludes that accomplishments (cul- 
minating activities) do not combine with "for ... " adverbials. This, however, seems 
over-restrictive. @J4p and (pT50|) , for example, seem acceptable. 



(2.149) BA737 taxied to gate 2 for two minutes. 

(2.150) Did J.Adams inspect BA737 for ten minutes? 
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meanings of duration "for ... " adverbials 


with lexical or progressive state 


situation holds continuously for at least that long 


with consequent state 


(not allowed in the framework of this thesis) 


with activity 


situation holds continuously for at least that long 


with culminating activity 


situation holds continuously for at least that long 
(no need for climax to be reached) 


with point 


(not allowed in the framework of this thesis) 



Table 2.4: Duration "for ... " adverbials in the framework of this thesis 



Unlike ( [2.151| ), in ( 2.149 ) there is no requirement that the taxiing must have been 
completed, i.e. that BA737 must have reached the gate. Similar comments can be 
made for (|2l50|) and ( gl5g) . "For ..." adverbials seem to cancel any requirement 



that the climax must have been reached. (Similar observations are made in | Dowty 
[ Moens k Steedman 88] , and flKent 93| .) 



(2.151) BA737 taxied to gate 2. 

(2.152) Did J.Adams inspect BA737? 



In the framework of this thesis, I allow "for ..." adverbials to combine with culminat- 
ing activities, with the same meaning that I adopted in the case of states and activities, 
and with the proviso that any requirement that the climax must have been reached 
should be cancelled. That is, in ( |2Tl5Cl ) there must be a ten-minute period throughout 
which J.Adams was inspecting BA737. 



Table 2.4 summarises the main points of this section. 



2.9.4 Duration "in ... " adverbials 



This section discusses "in ..." adverbials that specify durations (e.g. ( 2.153 ), ( 2.154| )). 
"In" can also introduce period adverbials (e.g. "in 1995"; see section section 2.9.2[ ). 



(2.153) Airserve serviced BA737 in two hours. 

(2.154) Which flight did J.Adams inspect in one hour? 



With culminating activities: With culminating activity expressions, "in ... " 
adverbials usually specify the length of a period that ends at the time-point where the 
situation of the culminating activity expression is completed. In ( 2.153|) , for example, 
two hours is probably the length of a period that ends at the time-point where the 
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service was completed. Q2.154 ) is similar. The period whose length is specified by 



the "in ... " adverbial usually starts at the time-point where the situation of the 
culminating activity expression begins. In (2.153), for example, the two hours probably 



start at the time-point where the service began. The period of the adverbial, however, 
may sometimes not start at the beginning of the situation of the culminating activity 



expression, but at some other earlier time. In ( 2.153 ), the start of the two hours could 
be the time-point where Airserve was asked to service BA737, not the beginning of the 
actual service. The framework of this thesis supports only the case where the period 
of the adverbial starts at the beginning of the situation described by the culminating 
activity expression. 

With points: With point expressions, the period of the "in ... " adverbial starts 
before the (instantaneous) situation of the point expression, and ends at the time-point 



where the situation of the point expression occurs. In ( 2.155 ) the ten minutes end at 
the point where BA737 arrived at gate 2, and start at some earlier time-point (e.g. 
when BA737 started to taxi to gate 2). ( [2.156| ) is similar. 

(2.155) BA737 reached gate 2 in ten minutes. 

(2.156) BA737 entered sector 2 in five minutes. 

Determining exactly when the period of the adverbial starts is often difficult. It is not 
clear, for example, when the five minutes of fl2.156| ) start. As a simplification, I do not 
allow duration "in ... " adverbials to combine with point expressions. 

With states and activities: "In ... " adverbials are sometimes used with activity 
expressions, with the "in ... " duration adverbial intended to specify the duration of 
the situation described by the activity expression. Typically, in these cases the speaker 
has a culminating activity view in mind. For example, ( [2.157 ) can be used in this way 



if the speaker has a particular destination (say gate 2) in mind. In that case, ( |2.157| ) 
can be thought as an elliptical form of ( 2.158[) . The framework of this thesis does not 



support this use of (|2.157[) 



(2.157) BA737 taxied in ten minutes. 

(2.158) BA737 taxied to gate 2 in ten minutes. 
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meanings of duration "in ..." adverbials 


with state, activity, or point 


(not allowed in the framework of this thesis) 


with culminating activity 


distance from the start to the completion of the situation 



Table 2.5: Duration "in ... " adverbials in the framework of this thesis 

With state and activity expressions, "in ..." adverbials can also specify the duration of 
a period that ends at the beginning of the situation of the state or activity expression. 
In ( 2.159| ), for example, the two hours probably end at the time-point where tank 5 



started to be empty. The beginning of the two hours could be, for example, a time- 
point where a pump started to empty the tank, or a time-point where a decision to 
empty the tank was taken. Similar comments apply to fl2.157| ). 

(2.159) Tank 5 was empty in two hours. 



As with point expressions, determining exactly when the period of the adverbial starts 
is often difficult. As a simplification, I do not allow duration "in ... " adverbials to 
combine with state or activity expressions. 



Table 2.5 summarises the main points of this section. 



2.9.5 Other temporal adverbials 



Other temporal adverbials, that are not supported by the framework of this thesis, 
include some adverbials that specify boundaries (e.g. "until 1/5/95" , "since 1987", "by 
Monday"), frequency adverbials ("always", "twice", "every Monday"), and adverbials 
of temporal order ("for the second time", "earlier"). 



2.10 Temporal subordinate clauses 



Three kinds of temporal subordinate clauses are examined in this thesis: clauses in- 
troduced by "while", "before", and "after" (e.g. clauses introduced by "since", "un- 
til", or "when" are not examined). From the temporal subordinate clauses that are 
not examined, "when ... " clauses are generally considered the most difficult to sup- 
port (see [[Ritchie 7S|| , [|Yip 85|] , [[Hinrichs 86|] , |M pens 87] ], ] Moens fc Steedman 88| ], and 
[Lascarides & Oberlander 93] for explorations of "when ... " clauses). 
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2.10.1 "While... "clauses 



Subordinate clause: As with period adverbials (section |2.9.2| ), each "while ... " 
clause is understood as specifying a time period. This is a maximal period throughout 
which the situation of the "while ... " clause holds. Let us assume, for example, that 
J.Adams was a manager only from 1/1/1980 to 31/12/1983, and from 1/1/1987 to 
31/12/1990. Then, in ( |2.160| ) the period of the "while ..." clause can be either one 
of these two periods. The user may have in mind a particular one of the two periods. 
In that temporal anaphora resolution mechanism is needed to determine that 

period (temporal anaphora is discussed in section 2.12| ). The framework of this thesis, 



however, provides no such mechanism (the answer to ( |2.160| ) includes anybody who 
was fired during any of the two periods). 

(2.160) Who was fired while J.Adams was a manager? 



Sentences where the aspectual class of the "while ... " clause is point (e.g. (2.161) in 
the airport domain) typically signal that the user is unaware that the situation of the 
"while ... " clause is modelled as instantaneous. In the framework of this thesis, the 



answer to ( [2.161 ) includes any flight that was circling at the time-point where BA737 



entered sector 2. Ideally, a message would also be generated to warn the user that 
entering a sector is modelled as instantaneous (no warning is currently generated). 
This is another case where cooperative responses (section |l.4[) are needed. 

(2.161) Which flights were circling while BA737 entered sector 2? 

Sentences containing "while ... " clauses whose aspectual class is consequent state 
(section p. 9.1 ) usually sound unnatural or unacceptable. For example, ( |2.162| ) - ( |2.164| ) 



sound at least unnatural (e.g. instead of ( |2.162| ) one would normally use ( |2.165| ) or 
( 2.166| )). Hence, I do not allow "while ..." clauses whose aspectual class is consequent 



state. This also avoids some complications in the English to Top mapping. 

(2.162) ?Did any flight depart while BA737 had landed? 

(2.163) ?Did ABM fire anybody while J.Adams had been the manager? 

(2.164) ?Had any flight departed while J.Adams had inspected BA737? 

(2.165) Did any flight depart while BA737 was landing? 

(2.166) Did any flight depart after BA737 had landed? 
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O CTAfliTi' 11 A n rtnc AT 
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consequent state 


(not allowed in the framework of this thesis) 


lexical/progressive state 
or activity 


maximal period where situation of "while ... " clause holds 


culminating activity 


maximal period where situation of "while ... " clause holds 
(no need for climax of "while ... " clause to be reached) 


point 


instant, period where situation of "where ... " clause occurs 



Table 2.6: Periods of "while ... " clauses in the framework of this thesis 

When the aspectual class of the "while ... " clause is culminating activity, there is 
no requirement that the climax of the situation of the "while ... " clause must have 
been reached, even if the tense of that clause normally requires this. In ( |2~l68D and 



(2.171), for example, there does not seem to be any requirement that the service or 



the boarding must have been completed (cf. ( |2.167D and ( |2.170| )). fl2.168| ) and ( |2.171| ) 



appear to have the same meanings as ( 2.169| ) and ( |2.172[ ) (in progressive tenses, there 



is no requirement for the climax to be reached; see section |2.5.3| ). 



Table 2.6 summarises the main points about "while ... " clauses so far. 



(2.167) Did Airserve service BA737? 

(2.168) Which flights departed while Airserve serviced BA737? 

(2.169) Which flights departed while Airserve was servicing BA737? 

(2.170) Did BA737 board? 

(2.171) Which flights departed while BA737 boarded? 

(2.172) Which flights departed while BA737 was boarding? 



Main clause: Once the periods of the "while ... " clauses have been established 



(following table 2.6), the behaviour of "while ... " clauses appears to be the same 
as that of period adverbials (i.e. it follows table |2.3| on page 49). With main clauses 
whose aspectual class is point, the instantaneous situation of the main clause must 
occur within the period of the "while ... " clause (e.g. in (2.173) the departures must 
have occurred during a maximal period where runway 5 was closed; "to depart" is a 
point verb in the airport domain). 



(2.173) Did any flight depart from gate 2 while runway 5 was closed? 
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With activity main clauses, the situation of the main clause must be ongoing some time 
during the period of the "while ... " clause. In ( 2.1 74] ) , for example, the flights must 



have taxied some time during a maximal period where BA737 was circling. As with 
period adverbials, stricter readings are sometimes possible with activity main clauses. 



( 2.174 ), for example, could refer to flights that both started and stopped taxiing during 
a maximal period where BA737 was circling. As with period adverbials, I ignore such 
stricter readings. 

(2.174) Which flights taxied while BA737 circled? 

As in the case of period adverbials, with culminating activity main clauses I allow 
two readings: (a) that the situation of the main clause both starts and reaches its 
completion within the period of the "while ..." clause, or (b) that the situation of the 
main clause simply reaches its completion within the period of the "while ... " clause. 
In the second reading, the main clause is taken to refer to only the completion point of 
the situation it would normally describe, and its aspectual is changed to point. In the 
airport domain, the first reading is the preferred one in ( 2.175| ). The second reading 



allows the answer to ( [2.176| ) to contain flights that simply touched down during the 
service, even if their landing procedures did not start during the service. 

(2.175) J.Adams inspected BA737 while Airserve was servicing UK160. 

(2.176) Which flights landed while Airserve was servicing UK160? 

With state main clauses, I require the situation of the main clause to hold some time 



during the period of the "while ... " clause (inclusive reading; see section |2.9.2[) . 



For example, the answer to ( 2.177 ) must contain anybody who was a lecturer some 
time during a maximal period where J.Adams was a professor (the non-auxiliary "to 
be" is typically classified as state verb). As with period adverbials, there is often an 
implication that the situation of the main clause holds throughout the period of the 
"while ... " clause (durative reading) . The durative reading is unlikely in fl2.177j ) , 
but seems the preferred one in ( |2.178[) (progressive state main clause). According to 



the durative reading, ( 2.178 ) refers to a flight that was circling throughout a maximal 



period where runway 2 was closed. 



(2.177) Who was a lecturer while J.Adams was a professor? 

(2.178) Which flight was circling while runway 2 was closed? 
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The treatment of "while ... " clauses of this thesis is similar to that of Ritchie 79]. 
Ritchie also views "while ..." clauses as establishing periods, with the exact relations 
between these periods and the situations of the main clauses depending on the aspectual 
classes of the main clauses. Ritchie uses only two aspectual classes ("continuing" 
and "completed" ) , which makes presenting a direct comparison between his treatment 
of "while ... " clauses and the treatment of this thesis difficult. Both approaches, 
however, lead to similar results, with the following two main exceptions, (a) In ( 2.177| ) 



and (B.17SD (state main clause), Ritchie's treatment admits only durative readings. 
In contrast, the framework of this thesis admits only inclusive ones, (b) In dTftj ) 
(culminating activity main clause), Ritchie's arrangements allow only one reading, 
where the landings must have both started and been completed during the service. 
The framework of this thesis allows an additional reading, whereby it is enough if the 
landings were simply completed during the service. 



2.10.2 "Before ..." and "after ..." clauses 



I treat "before ... " and "after ... " clauses as establishing periods, as in the case of 



the "before ... " and "after ... " adverbials of section |2.9.2| . In "before ... " clauses, 
the period starts at some unspecified time-point (in the absence of other constraints, 
the beginning of time), and ends at a time-point provided by the "before ... " clause. 
In "after ... " clauses, the period starts at a time-point provided by the "after ... " 
clause, and ends at some unspecified time-point (the end of time, in the absence of 
other constraints). I use the terms before-point and after-point to refer to the time- 
points provided by "before ..." and "after ..." clauses respectively. Once the periods 
of the "before ... " and "after ... " clauses have been established, the behaviour of 



the clauses appears to be the same as that of period adverbials (i.e. it follows table ^3 
on page fl9|). 



State "before/after ... " clause: Let us first examine sentences where the aspec- 
tual class of the "before ... "or "after ..." clause is state. With "before ..." clauses, 
the before-point is a time-point where the situation of the "before ... " clause starts 
(table |2.7[) . In fl2.179| ), for example, the before-point is a time-point where runway 2 
started to be open. The aspectual class of the main clause is point ( "to depart" is 
a point verb in the airport domain). Hence, according to table |2.3| on page the 
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"before ... " clause 


(right boundary of period specified by "before ... " clause) 


state 


time-point where situation of "before ... " clause starts 


activity 


time-point where situation of "before ... " clause starts 


culm, activity 


time-point where situation of "before ... " clause 
starts or is completed 


point 


time-point where situation of "before ... " clause occurs 



Table 2.7: Boundaries of "before ... " clauses in the framework of this thesis 



departures must have occurred within the period of the "before ..." clause, i.e. before 
the time-point where runway 2 started to be open. Similar comments apply to ( |2.18C| ), 
( |2.181[ ) (progressive "before ... " clause), and ( p.!82j ) (consequent state "before ... " 



clause). In (2.182), the before-point is the beginning of the consequent period of the 



inspection (the period that contains all the time after the completion of the inspection; 



see section 2.9.1), i.e. the departures must have happened before the inspection was 
completed. 

(2.179) Which flights departed before runway 2 was open? 

(2.180) Which nights departed before the emergency system was in operation? 

(2.181) Which nights departed before BA737 was circling? 

(2.182) Which flights departed before J.Adams had inspected BA737? 



According to table 2.3 on page H, in ( fOgD where the main clause is a state, the flight 
must have been at gate 2 some time during the period of the "before ... " clause, i.e. 
for some time before runway 2 started to be open. In ( ETI84D (activity main clause), 
the flight must have circled for some time before runway 2 started to be open, and 



in (2.185) (culminating activity main clause) the inspections must have both started 
and been completed before runway 2 started to be open. (As with the "before ... " 
adverbials of section 2.9.2, in Q2.185 ) it would be better if the Nlitdb could also report 
inspections that started but were not completed before runway 2 opened, warning the 
user that these inspections were not completed before runway 2 opened.) 

(2.183) Was any flight at gate 2 before runway 2 was open? 

(2.184) Did any flight circle before runway 2 was open? 

(2.185) Which flights did J.Adams inspect before runway 2 was open? 
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asnpftiiril class of 
"before ..." clause 


after-point 

(left boundary of period specified by "after ... " clause) 


lexical/progressive state 


time-point where situation of "after ... " clause starts or ends 


consequent state 


time-point where consequent period of "after ... " clause starts 


activity 


time-point where situation of "after ... " clause ends 


culm, activity 


time-point where situation of "after ... " clause is completed 


point 


time-point where situation of "before ... " clause occurs 



Table 2.8: Boundaries of "after ... " clauses in the framework of this thesis 

In the case of "after ... " clauses, when the aspectual class of the "after ... " clause 
is state, the after-point is a time-point where the situation of the "after ... " clause 
either starts or ends. fl2.186j ), for example, has two readings: that the nights must have 
departed after runway 2 started to be open, or that the flights must have departed after 
runway 2 stopped being open. Similar comments apply to ( [2.1871 ) an d ( 2.188 ). 



(2.186) Which nights departed after runway 2 was open? 

(2.187) Which nights departed after the emergency system was in operation? 

(2.188) Which flights departed after BA737 was circling? 

In sentences like ( |2.189| ), where the aspectual class of the "after ... " clause is conse- 
quent state, the after-point can only be the beginning of the consequent period (the 
first time-point after the completion of the inspection). It cannot be the end of the 
consequent period: the end of the consequent period is the end of time; if the after- 
point were the end of the consequent period, the departures of ( 2.189Q would have to 



occur after the end of time, which is impossible. This explains the distinction between 
lexical/progressive and consequent states in table |2.8j. 



(2.189) Which nights departed after J.Adams had inspected BA737? 



Point "before/after ... " clause: If the aspectual class of the "before ... "or 
"after ... " clause is point, the before/after-point is the time-point where the in- 
stantaneous situation of the subordinate clause occurs. In ( [2.190| ), for example, the 
before/after-point is the point where BA737 reached gate2. 



(2.190) Which flights departed before/after BA737 reached gate 2? 
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Activity "before/after ... " clause: With activity "before/after ... " clauses, I 
consider the before-point to be a time-point where the situation of the "before ... " 
clause starts, and the after-point to be a point where the situation of the "after ... " 
clause ends. In ( p. 191 ) and ( |2.192| ), for example, the departures must have occurred 



before BA737 started to taxi or circle. In ( |2.193 ) and ( 2.194 ), the departures must 



have occurred after BA737 stopped taxiing or circling. 

(2.191) Which flights departed before BA737 taxied? 

(2.192) Which flights departed before BA737 circled? 

(2.193) Which flights departed after BA737 taxied? 

(2.194) Which flights departed after BA737 circled? 



Perhaps another reading is sometimes possible with "after ..." clauses: that the after- 
point is a time-point where the situation of the "after ... " clause starts (e.g. ( |2.194| ) 
would refer to departures that occurred after BA737 started to circle). This reading, 
however, does not seem very likely, and for simplicity I ignore it. 



Culminating activity "before/after ... " clause: With "after ... " clauses 
whose aspectual class is culminating activity, I consider the after-point to be a time- 
point where the situation of the "after ... " clause reaches its completion. In ( |2.195| ), 
the departures must have occurred after the completion of the inspection, and in ( |2.196| ) 
they must have occurred after the time-point where BA737 reached gate 2. 

(2.195) Which flights departed after J.Adams inspected BA737? 

(2.196) Which flights departed after BA737 taxied to gate 2? 



With culminating activity "before ..." clauses, I allow the before-point to be a time- 
point where the situation of the "before ... " clause either starts or reaches its com- 
pletion. In the airport domain, the first reading seems the preferred one in ( |2.197| ) 
(the flights must have departed before the beginning of the inspection). The second 
reading seems the preferred one in ( 2.199| ) (the flights must have departed before the 



completion of the landing). Both readings seems possible in ( 2.199 ). 



(2.197) Which flights departed before J.Adams inspected BA737? 

(2.198) Which flights departed before BA737 landed? 



CHAPTER 2. THE LINGUISTIC DATA AND AN INFORMAL ACCOUNT 62 
(2.199) Which flights departed before BA737 taxied to gate 2? 



If the first reading is adopted (the situation of the "before ... " clause starts at the 
before-point) and the "before ... " clause is in the simple past, it is unclear if the 
situation of the "before ... " clause must have necessarily reached its climax (the 
simple past of culminating activity verbs normally requires this; see section 2.5.2| ). For 



example, let us assume that the first reading is adopted in ( [2,197| ). Should the before- 
point be the beginning of an inspection that was necessarily completed, or can it also 
be the beginning of an inspection that was never completed? The framework of this 
thesis currently adopts the first approach, but this is perhaps over-restrictive. It would 
probably be better if the Nlitdb allowed the before-point to be the beginning of both 
inspections that were and were not completed, warning the user about inspections that 
were not completed. This is another case for cooperative responses (section |L 



Other uses: "Before" and "after" can be preceded by expressions specifying dura- 
tions (e.g. (|2.200|) ). This use of "before" and "after" is not considered in this thesis. 



(2.200) BA737 reached gate 2 five minutes after UK160 departed. 



"Before ... " clauses also have counter-factual uses. For example, in ( 2.201| ) (from 



flCrouch "91 1 ) the situation where the car runs into the tree never takes place. This use 



of "before" is not considered in this thesis. 

(2.201) Smith stopped the car before it ran into the tree. 

The treatment of "before ..." and "after ..." clauses of this thesis is similar to that of 
[ Ritchie 79[ . Ritchie also views "before ..." and "after ..." clauses as providing before 



and after-points. As noted in section 2.10.1, however, Ritchie uses only two aspectual 



classes. According to Ritchie, in the case of "before ... " clauses, the before-point is 
a time-point where the situation of the "before ... " clause starts, and the situation 
of the main clause must simply start before that point. In ( |2.185| ), this requires the 
inspections to have simply started before the time-point where runway 2 started to be 
open. In contrast, the framework of this thesis requires the inspections to have been 
completed before that time-point. 

In the case of "after ... " clauses, the main difference between Ritchie's treatment and 
the treatment of this thesis concerns state "after ... " clauses. In that case, Ritchie 
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allows the after-point to be only the beginning of the situation of the "after ... " 
clause. In ([U87]), this requires the flights to have departed after the time-point where 
the system started to be in operation. The framework of this thesis allows an additional 
reading, where the flights must have departed after the time-point where the system 
stopped being in operation. 



2.10.3 Tense coordination 



Some combinations of tenses in the main and subordinate clauses are unacceptable (e.g. 
( 2.202[ ), Q2.203D ). This thesis makes no attempt to account for the unacceptability of 



such combinations. The reader is referred to | ]Harper &: Charniak 86| j and [ |Brent 90 ] 



for methods that could be used to detect and reject sentences like ( 2.202| ) and ( |2.203] ). 



(2.202) *BA737 left gate 2 before runway 2 is free. 

(2.203) * Which runways are closed while runway 2 was circling? 



2.11 Noun phrases and temporal reference 



A question like ( |2.204 ) can refer either to the present sales manager (asking the 1991 



salary of the present sales manager) or to the 1991 sales manager (asking the 1991 salary 



of the 1991 sales manager). Similarly, ( 2.205 ) may refer either to present students or 



last year's students. In ( 2.206|) , "which closed runway" probably refers to a runway 



that is currently closed, while in ( |2.207| ) "a closed runway" probably refers to a runway 
that was closed at the time of the landing. 

(2.204) What was the salary of the sales manager in 1991? 

(2.205) Which students failed in physics last year? 

(2.206) Which closed runway was open yesterday? 

(2.207) Did BA737 ever land on a closed runway in 1991? 



It seems that noun phrases (e.g. "the sales manager", "which students", "a closed 
runway") generally refer either to the present or to the time of the verb tense (if this 
time is different than the present). In ( [2.204| ), the simple past tense refers to some 
time in 1991. Therefore, there are two options: "the sales manager" can refer either to 
the present sales manager or to somebody who was the sales manager in 1991. Similar 
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comments apply to ( p.205| ). In contrast, in ( [2.208 ) the verb tense refers to the present. 



Hence, there is only one possibility: "the sales manager" refers to the present sales 
manager. 

(2.208) What is the salary of the sales manager? 

In ( [2 .206] ), the verb tense refers to a time (within the previous day) where the runway 
was open. There should be two readings: it should be possible for "which closed 
runway" to refer either to a currently closed runway, or to a runway that was closed 
at the time it was open. Since a runway cannot be closed at the same time where it is 
open, the second reading is ruled out. (This clash, however, cannot be spotted easily 
by a Nlitdb without some inferential capability.) 

The hypothesis that noun phrases refer either to the present or to the time of the verb 
tense is not always adequate. For example, a person submitting ( 2.209D to the Nlitdb 



of a university most probably refers to previous students of the university. In contrast, 
the hypothesis predicts that the question can refer only to current students. (Similar 
examples can be found in [ Enc 86| ].) 



(2.209) How many of our students are now professors? 



The hypothesis also predicts that ( 2.210| ) can refer only to current Prime Ministers 



or to persons that were Prime Ministers at the time they were born (an extremely 
unlikely reading). There is, however, a reading where the question refers to all past 
and present Prime Ministers. This reading is incorrectly ruled out by the hypothesis. 

(2.210) Which Prime Ministers were born in Scotland? 



Hinrichs [Hinrichs 85] argues that determining the times to which noun phrases refer 
is part of a more general problem of determining the entities to which noun phrases 
refer. According to Hinrichs, a noun phrase like "every admiral" generally refers to 
anybody who was, is, or will be an admiral of any fleet in the world at any time. If, 
however, ( 2.21l| ) is uttered in a context where the current personnel of the U.S. Pacific 



fleet is being discussed, the temporal scope of "every admiral" is restricted to current 
admirals, in the same way that the scope of "every admiral" is restricted to admirals 



of the U.S. Pacific fleet (e.g. ( 2.211 ) does not mean that all Russian admirals also 
graduated from Annapolis). 
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(2.211) Every admiral graduated from Annapolis. 



The fact that Hinrichs does not limit the times of the noun phrases to the present 
and the time of the verb tense is in accordance with the fact that "our students" in 



( 2.209 ) is not limited to present students, and the fact that "which Prime Ministers" 



m may refer to all past and present Prime Ministers. Hinrichs' approach, 



however, requires some mechanism to restrict the scope of noun phrases as the discourse 
evolves. Hinrichs offers only a very limited sketch of how such a mechanism could be 
constructed. Also, in the absence of previous discourse, Hinrichs' treatment suggests 



that (2.204) refers to the sales managers of all times, an unlikely interpretation. The 
hypothesis that noun phrases refer either to the present or to the time of the verb tense 
performs better in this case. Given these deficiencies of Hinrichs' approach, I adopt 
the initial hypothesis that noun phrases refer to the present or the time of the verb 
tense. (An alternative approach would be to attempt to merge this hypothesis with 
Hinrichs' method, [ palrymple 8§| ] goes towards this direction.) 

A further improvement can be made to the hypothesis that noun phrases refer to the 
present or the time of the verb tense. When a noun phrase is the complement of 
the predicative "to be", it seems that the noun phrase can refer only to the time of 
the verb tense. (2.212), for example, can only be a request to report the 1991 sales 



manager, not the current sales manager. Similarly, ( 2.213| ) cannot mean that J.Adams 



is the current sales manager. This also accounts for the fact that in ( p.204 ), unlike 



"the saJes manager" which can refer either to the present or 1991, "the salary of the 
sales manager" (the complement of "was") can refer only to a 1991 salary, not to a 
present salary. (I assume that the restriction that the complement of the predicative 
"to be" must refer to the time of the verb tense does not extend to noun phrases that 
are subconstituents of that complement, like "the sales manager" in Q2.204 ).) The 



same restriction applies to bare adjectives used as complements of the predicative "to 
be". In ( 2.206| ), for example, "open" can only refer to runways that were open on the 



previous day. It cannot refer to currently open runways. 

(2.212) Who was the sales manager in 1991? 

(2.213) J.Adams was the sales manager in 1991. 

The hypothesis that noun phrases refer to the present or the time of the verb tense 
does not apply when a temporal adjective (e.g. "current") specifies explicitly the time 
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of the noun phrase (e.g. ( 2.214p ). (Although temporal adjectives are not considered in 



this thesis, I support "current" to be able to illustrate this point.) 
(2.214) Which current students failed in Physics last year? 

In chapter |||, an additional mechanism will be introduced, that allows the person 
configuring the Nlitdb to force some noun phrases to be treated as always referring 
to the time of the verb tense, or as always referring to the present. 

2.12 Temporal anaphora 

There are several English expressions (e.g. "that time", "the following day", "then", 
"later") that refer implicitly to contextually salient times, in a way that is similar to 
how pronouns, possessive determiners, etc. refer to contextually salient world entities 
(the terms temporal and nominal anaphora were used in section |L4| to refer to these 
two phenomena; the parallels between temporal and nominal anaphora are discussed 



in [Partee 84|). For example, the user of a Nlitdb may submit (2.215), followed by 



|2.216[ ). In (2.216), "at that time" refers to the time when John became manager 



(temporal anaphora). In a similar manner, "he" refers to John (nominal anaphora). 

(2.215) When did John become manager? 

(2.216) Was he married at that time? 

Names of months, days, etc. often have a similar temporal anaphoric nature. For 
example, in a context where several questions about the 1990 status of a company 
have just been asked, ( 2.217| ) most probably refers to the January of 1990, not any 



other January. In the absence of previous questions, (2.217) most probably refers to 



the January of the current year. (See section 5.5.1 of [Kamp & Reyle 93| for related 
discussion.) 

(2.217) Who was the sales manager in January? 

Verb tenses also seem to have a temporal anaphoric nature (the term tense anaphora is 
often used in this case). For example, the user may ask ( |2,218| ) (let us assume that the 
response is "no "), followed by (|2.219|) . In that case, the simple past "was " of ( gjjj ) 



does not refer to an arbitrary past time, it refers to the past time of the previous 
question, i.e. 1993. 
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(2.218) Was Mary the personnel manager in 1993? 

(2.219) Who was the personnel manager? 

The anaphoric nature of verb tenses is clearer in multi-sentence text (see [|Hinr ichs 86fl , 
| Webber 88(1 , jKamp fc Reyle 93| , |Kameyama et al. 93| for related work). In ( |2.22q ), 



for example, the simple past "landed" refers to a landing that happened immediately 
after the permission of the first sentence was given. It does not refer to an arbitrary 
past time where BA737 landed on runway 2. Similar comments apply to the "taxied". 

(2.220) BA737 was given permission to land at 5:00pm. It landed on runway 2, and 
taxied to gate 4. 



In dialogues like the one in ( |2.218| ) - ( 2.219| ), a simplistic treatment of tense anaphora 



is to store the time of the adverbial of ( 2.218| ), and to require the simple past of ( 2.219j ) 



to refer to that time. (A more elaborate version of this approach will be discussed in 
section 



3.17 



The behaviour of noun phrases like "the sales manager" of section p. 11 can be seen 



as a case of temporal anaphora. This is the only type of temporal anaphora that 
is supported by the framework of this thesis. Expressions like "at that time", "the 
following day", etc. are not supported, and tenses referring to the past are taken to 
refer to any past time. For example, ( 2.219| ) is taken to refer to anybody who was the 



personnel manager at any past time. The reader is also reminded (section |l.4| ) that 
nominal anaphora is not considered in this thesis. 

2.13 Other phenomena that are not supported 

This section discusses some further phenomena that are not supported by the frame- 
work of this thesis. 



Cardinality and duration questions: Questions about the cardinality of a set or 
the duration of a situation (e.g. ( 2.221 ), ( |2.222j )) are not supported. (Top is currently 
not powerful enough to express the meanings of these questions.) 



(2.221) How many flights have landed today? 

(2.222) For how long was tank 2 empty? 
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Cardinalities and plurals: Expressions specifying cardinalities of sets (e.g. "eight 
passengers" , "two airplanes") are not supported (this does not include duration expres- 
sions like "Eve hours", which are supported). Expressions of this kind give rise to a dis- 
tinction between distributive and collective readings [Stirling 85] [Crouch & Pulman 93] 



( 2.223 ), for example, has a collective reading where the eight passengers arrive at the 
same time, and a distributive one where there are eight separate arrivals. This dis- 
tinction was not explored during the work of this thesis. Top is also currently not 
powerful enough to express cardinalities of sets. 

(2.223) Eight passengers arrived. 

The framework of this thesis accepts plural noun phrases introduced by "some" and 
"which" (e.g. "some Eights", "which passengers"), but it treats them semantically as 
singular. For example, (2.224) and (2.226) are treated as having the same meanings 
as ( gggg ) and ( g227| ) respectively. 



(2.224) Which flights landed? 

(2.225) Which flight landed? 

(2.226) Some flights entered sector 2. 

(2.227) A flight entered sector 2. 



Quantifiers: Expressions introducing universal quantifiers at the logical level (e.g. 
"every", "all") are not supported. This leaves only existential quantifiers (and an 
interrogative version of them, to be discussed in chapter ||) at the logical level, avoid- 
ing issues related to quantifier scoping (see also section |6.6.2 ). It also simplifies the 
semantics of Top and the mapping from Top to Tsql2. 



Conjunction, disjunction, and negation: Conjunctions of words or phrases are 
not supported. Among other things, this avoids phenomena related to sequencing 
of events. For example, (|2.228|) is understood as saying that the patient died after 
(and probably as a result of) being given Qdrug (cf. fl2.229j ) which sounds odd). In 
contrast, in ( 2.230| ) the patient was given Qdrug while he had high fever. (See, for 
example, [Hinrichs 86], [Hinrichs 88], [ Webber 88| , |Kamp &; Reyle 93], [terMeulen 94], 



and [Hwang & Schubert 94] for related work.) 
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(2.228) Which patient was given Qdrug and died? 

(2.229) ? Which patient died and was given Qdrug? 

(2.230) Which patient had high fever and was given Qdrug? 

Expressions introducing disjunction or negation (e.g. "or", "either", "not", "never") 
are also not supported. This simplifies the semantics of Top and the Top to Tsql2 
mapping. Not supporting negation also avoids various temporal phenomena related 
to negation (see section 5.2.5 of [ Kamp &; Reyle 93fl ), and claims that negation causes 



aspectual shifts (see, for example, [ Dowty 8q| and flMoens 8711 ). 



Relative clauses: Relative clauses are also not supported. Relative clauses require 
special temporal treatment. (2.231), for example, most probably does not refer to a 



runway that was closed at an arbitrary past time; it probably refers to a runway that 
was closed at the time of the landing. The relation between the time of the relative 
clause and that of the main clause can vary. In ( 2.232| ) (from [ Dowty 8q ]), for example, 



the woman may have seen John during, before, or even after the stealing. 

(2.231) Which flight landed on a runway that was closed? 

(2.232) The woman that stole the book saw John. 

Relative clauses can also be used with nouns that refer to the temporal ontology (e.g. 
"period" in ( |2.233| )). Additional temporal phenomena involving relative clauses are 
discussed in section 5.5.4.2 of [ Kamp fc Reyle 9"^] . 



(2.233) Who was fired during the period that J.Adams was personnel manager? 

Passives: Finally, I have concentrated on active voice verb forms. This simplifies the 
Hpsg grammar of chapter 4. It should be easy to extend the framework of this thesis 
to cover passive forms as well. 

2.14 Summary 



The framework of this thesis uses an aspectual taxonomy of four classes (states, 
points, activities, and culminating activities). This taxonomy classifies verb forms, 
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verb phrases, clauses, and sentences. Whenever the Nlitdb is configured for a new 
application, the base form of each verb is assigned to one of the four aspectual classes. 
All other verb forms normally inherit the aspectual class of the base form. Verb 
phrases, clauses, and sentences normally inherit the aspectual classes of their main 
verb forms. Some linguistic mechanisms (e.g. progressive tenses, or some temporal 
adverbials), however, may cause the aspectual class of a verb form to differ from that 
of the base form, or the aspectual class of a verb phrase, clause, or sentence to differ 
from that of its main verb form. The aspectual taxonomy plays an important role in 
most time-related linguistic phenomena. 

Six tenses (simple present, simple past, present continuous, past continuous, present 
perfect, and past perfect) are supported, with various simplifications introduced in their 
meanings. Some special temporal verbs were identified (e.g. "to happen", "to start"); 
from these only "to start", "to begin", "to stop", and "to finish" are supported. 

Some nouns have a special temporal nature. For example, some introduce situations 
(e.g. "inspection"), others specify temporal order (e.g. "predecessor"), and others refer 
to entities of the temporal ontology (e.g. "day", "period", "event"). From all these, only 
nouns like "year", "month", "day", etc. (and proper names like "Monday", "January", 
and "1/5/92") are supported. Nouns referring to more abstract temporal entities (e.g. 
"period", "event") are not supported. No temporal adjectives (e.g. "first", "earliest") 
are handled, with the only exception of "current" which is supported to demonstrate 
the anaphoric behaviour of some noun phrases. 

Among temporal adverbials, only punctual adverbials (e.g. "at 5:00pm"), "for... "and 
"in ..." duration adverbials, and period adverbials introduced by "on", "in", "before", 
or "after", as well as "today" and "yesterday" are handled. Frequency, order, or other 
adverbials that specify boundaries (e.g. "twice", "for the second time", "since 1992") 
are not supported. 

Only subordinate clauses introduced by "while", "before", and "after" are handled 
(e.g. clauses introduced by "when" or "since" and relative clauses are not supported). 
The issue of tense coordination between main and subordinate clauses is ignored. 

Among temporal anaphora phenomena, only the temporal anaphoric nature of noun 
phrases like "the sales manager" is supported. Proper names like "May" or "Monday" 
are taken to refer to any May or Monday. Similarly, past tenses are treated as referring 
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to any past time. Temporal anaphoric expressions like "that time" or "the following 
day" are not allowed. (Nominal anaphoric expressions, e.g. "he", "her salary", are also 
not allowed.) 

The framework of this thesis does not support cardinality or duration queries ( "How 
many... ?", "How long .. . ?") and cardinality expressions (e.g. "Eve flights") . Plurals 
introduced by "which" and "some" (e.g. "which flights", "some gates") are treated 
semantically as singular. Conjunctions of words or phrases, and expressions introducing 
universal quantifiers, disjunction, or negation are also not supported. Finally, only 
active voice verb forms have been considered, though it should be easy to extend the 
mechanisms of this thesis to support passive voice as well. 



Table 2.9 summarises the linguistic coverage of the framework of this thesis. 



CHAPTER 2. THE LINGUISTIC DATA AND AN INFORMAL ACCOUNT 72 



verb tenses 


y/ simple present (excluding scheduled meaning) 
sj simple past 

y/ present continuous (excluding futurate meaning) 
\J past continuous (excluding futurate meaning) 
xj present perfect (treated as simple past) 
yj past perfect 
x other tenses 


temporal verbs 


yj "to start", "to begin", "to stop", "to finish" 

x other temporal verbs (e.g. "to happen", "to follow") 


temporal nouns 


yj "year", "month", "day", etc. 

x "period", "event", "time", etc. 

x nouns introducing situations (e.g. "inspection") 

x nouns of temporal order (e.g. "predecessor") 


temporal adjectives 


x (only "current") 


temporal adverbials 


yj punctual adverbials (e.g. at 5:U(Jpm ) 

yj period adverbials (only those introduced by "on", "in", 

"before", or "after", and "today", "yesterday") 
yj "for ..." adverbials 

yj "in ... " duration adverbials (only with culm. act. verbs) 

x frequency adverbials (e.g. "twice") 

x order adverbials (e.g. "for the second time") 

x other boundary adverbials (e.g. "since 1987") 


subordinate clauses 


yj "while ... " clauses 
yj "before ..." clauses 
yj "after ..." clauses 
x relative clauses 

x other subordinate clauses (e.g. introduced by "when") 
x tense coordination between main-subordinate clauses 


anaphora 


yj noun phrases and temporal reference 
x "January", "August", etc. 

(taken to refer to any January, August, etc.) 
x tense anaphora 

(past tenses taken to refer to any past time) 
x "that time", "the following day", etc. 
x nominal anaphora (e.g. "he", "her salary") 


other phenomena 


x cardinality and duration queries 

( "How many . . . ?", "How long . . . ?") 

x cardinality expressions (e.g. "five Eights") 

x plurals (treated as singulars) 

x conjunctions of words or phrases 

x expressions introducing universal quantifiers, 
disjunction, negation 

x passive voice 



Table 2.9: The linguistic coverage of the framework of this thesis 



Chapter 3 

The TOP Language 



"Time will tell." 

3.1 Introduction 

This chapter defines Top, the intermediate representation language of this thesis. As 
noted in section 1.2.3; , Top employs temporal operators. ([3.1|), for example, is repre- 



sented in Top as ( |3.2| ). Roughly speaking, the Past operator requires contain(tank2, water) 
to be true at some past time e v , and the At operator requires that time to fall within 
1/10/95. The answer to (3.1) is affirmative iff (|3.2|) evaluates to true. 



(3.1) Did tank 2 contain water (some time) on 1/10/95? 

(3.2) At[l 1 10 1 95 , Past[e v ,contain(tank2, water)]] 

An alternative operator-less approach is to introduce time as an extra argument of 



each predicate (section 1.2.3 ). I use temporal operators because they lead to more 
compact formulae, and because they make the semantic contribution of each linguistic 
mechanism easier to see (in (3.2), the simple past tense contributes the Past operator, 



while the "on ... " adverbial contributes the At operator). 

Top is period-based, in the sense that the truth of a Top formula is checked with 
respect to a time-period (a segment of the time-axis) rather than an individual time- 
point. (The term "period" is used here to refer to what other authors call "intervals"; 
see section [T^ below.) A Top formula may be true at a time-period without being 
true at the subperiods of that period. Actually, following the Reichenbachian tradition 
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| Rcichenbach 47| , Top formulae are evaluated with respect to more than one times: 
speech time (time at which the question is submitted), event time (time where the 
situation described by the formula holds), and localisation time (a temporal window 
within which the event time must be placed; this is different from Reichenbach's ref- 
erence time, and similar to the "location time" of [ Kamp fe Rcylc 9^ 1). While speech 
time is always a time-point, the event and localisation times are generally periods, 
and this is why I consider Top period-based. Period-based languages have been used 
in flDowty 82| , jAUen 84|1 , RLascarides 88| , jRichards et al. 89|1 , [[Pratt fc Bree 95|| , and 
elsewhere. Multiple temporal parameters have been used by several researchers (e.g. 
[ Dowty ||] , flHinrichs 88| , HBrent 90fl , jCrouch k Pulman 93| ). The term "localisation 



time" is borrowed from | Crouch fc Pulman 93 1, where It is a temporal window for et 
as in Top. 

Although the aspectual classes of linguistic expressions affect how these expressions 
are represented in Top, it is not always possible to tell the aspectual class of a lin- 
guistic expression by examining the corresponding Top formula. The approach here is 
different from those of [Dowty 77], [Dowty 86], | Lascarides 88}| , and |Kent 93| 1, where 
aspectual class is a property of formulae (or denotations of formulae). 



Top was greatly influenced by the representation language of Pirie et al. [ Pirie et al. 9C ] 
jCrouch 91f jCrouch h Pulman 93| , that was used in a natural language front-end to 
a planner. Top, however, differs in numerous ways from the language of Pirie et al. 
(several of these differences will be mentioned in following sections). 



3.2 Syntax of TOP 



This section defines the syntax of Top. Some informal comments about the semantics 
of the language are also given to make the syntax definition easier to follow. The 
semantics of Top will be defined formally in following sections. 



Terms: Two disjoint sets of strings, CONS (constants) and VARS (variables), are 
assumed. I use the suffix ilv " to distinguish variables from constants. For example, 
runway v ,gatel v G VARS, while ba737, 1/5/94 6 CONS. TERMS (Top terms) is 
the set CONS U VARS. (Top has no function symbols.) 
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Predicate functors: A set of strings PFUNS is assumed, 
predicate functors (see atomic formulae below). 
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These strings are used as 



Complete partitioning names: A set of strings CPARTS is assumed. These 
strings represent complete partitionings of the time-axis. A complete partitioning of 
the time-axis is a set of consecutive non-overlapping periods, such that the union of 
all the periods covers the whole time-axis. (A formal definition will be given in section 
|3.4| .) For example, the word "day" corresponds to the complete partitioning that con- 
tains the period that covers exactly the day 13/10/94, the period that covers exactly 
14/10/94, etc. No day-period overlaps another one, and together all the day-periods 
cover the whole time-axis. Similarly, "month" corresponds to the partitioning that 
contains the period for October 1994, the period for November 1994, etc. I use the 
suffix " c " for elements of CPARTS. For example, day could represent the partitioning 
of day-periods, and month the partitioning of month-periods. 

Gappy partitioning names: A set of strings GPARTS is assumed. These strings 
represent gappy partitionings of the time- axis. A gappy partitioning of the time- axis 
is a set of non-overlapping periods, such that the union of all the periods does not 
cover the whole time-axis. For example, "Monday" corresponds to the gappy parti- 
tioning that contains the period which covers exactly the Monday 17/10/94, the period 
that covers exactly the Monday 24/10/94, etc. No Monday-period overlaps another 
Monday-pariod, and all the Monday-periods together do not cover the whole time-axis. 
I use the suffix " 9 " for elements of GPARTS. For example, monday 9 could represent 
the partitioning of Monday-periods, and 5:00pm 9 the partitioning of all 5:00pm-periods 
(the period that covers exactly the 5:00pm minute of 24/10/94, the period that covers 
the 5:00pm minute of 25/10/94, etc.). 

Partitioning names: PARTS (partitioning names) is the set CPARTS U GPARTS. 

Atomic formulae: AFORMS (atomic formulae) is the smallest possible set, such 
that: 

• If 7T € PFUNS, and n, r 2 , . . . , r n G TERMS, then 7r(n, r 2 , . . . , r„) € AFORMS. 
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7r ( r i) T 2i ■ ■ ■ , T n ) is called a predicate. t±, t%, . . . , r n are the arguments of the pred- 
icate. 

• Ifcr G PARTS, (3 G VARS, &ndv ord G {...,-3,-2,-1,0}, then Part [a, (3, u ord ] G 
AFORMS and Part[cr,0] G AFORMS. 

Greek letters are used as meta- variables, i.e. they stand for expressions of Top. Pred- 
icates (e.g. 6e_at(6a737 , gate")) describe situations in the world. Part [a, [3, v rd\ means 
that (3 is a period in the partitioning a. The v ord is used to select a particular period 
from the partitioning. If v or d = 0, then (3 is the current period of the partitioning (the 
one that contains the present moment). If v ord < 0, then (3 is the — fo^-th period of 
the partitioning before the current one. When there is no need to select a particular 
period from a partitioning, the Part[a, f3] form is used. 

Yes/no formulae: Yes/no formulae represent questions that are to be answered 
with a "yes" or "no" (e.g. "7s BA737 circling?"). YNFORMS is the set of all yes/no 
formulae. It is the smallest possible set, such that if it G PFUNS, t\, . . . , r n G TERMS, 
(f), 0i, <fe G FORMS, a c G CPARTS, u qty G {1, 2, 3, ...},/? is a Top variable that does 
not occur in <ft, and r is a Top variable that does not occur in (j) or a Top constant, 
all the following hold. (The restriction that (3 and r must not be variables that occur 
in (j) is needed in the translation from TOP to TSQL2 of chapter ||.) 

• AFORMS C YNFORMS 

• 0i A 02 G YNFORMS 

• Pres[0], Port [0,0], Per/[/3,0] G YNFORMS 

• Ai[r,0], Ai[0i,0 2 ] G YNFORMS 

• Pe/ore[r,0], Pe/ore[0i, 2 ], After [r,0], A/ter[0i,0 2 ] G YNFORMS 

• Mense[/3,0], Ntense[now*, 0] G YNFORMS 

• For[<7 c ,i/ gt2/ ,0], Fa7Zs[0] G YNFORMS 

• Begin[cj)}, End[(p] G YNFORMS 

• CWm[7r(ri, . . . , r«)] G YNFORMS 
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No negation and disjunction connectives are denned, because English expressions in- 
troducing these connectives are not considered (section p. 13 ). For the same reason 



no universal quantifiers are defined. All variables can be thought of as existentially 
quantified. Hence, no explicit existential quantifier is needed. 

An informal explanation of Top's operators follows (Top's semantics will be defined 
formally in following sections). Pres[(p] means that (j) is true at the present. For 
example, "Runway 2 is open." is represented as Pres[open(runway2)]. Past[f3,4>] 
means that cj> is true at some past time (3. The Perf operator is used along with the 
Past operator to express the past perfect. For example, "Runway 2 was open." is 
represented as Past[e v , open(runway2)], and "Runway 2 had been open." as: 

Past[el v , Perf[e2 v ,open(runway2)]] 



At[r, (j)] means that 4> holds some time within a period r, and At[<f>i, 02 ] means that 02 
holds at some time where <pi holds. For example, "Runway 2 was open (some time) on 
1/1/94." is represented as At[l /l / 94 , Past [e v , open(runway2)]], and "Runway 2 was 
open (some time) while BA737 was circling." as: 

At[Past[el v , circling(ba737)\, Past[e2 v , open(runway2)}] 



Before[r,<j)] means that is true at some time before a period r, and Before[4>i, 02] 
means that 02 is true at some time before a time where 0i is true. After[r, 0] and 
After[4>i, (j>2\ have similar meanings. For example, "Tank 2 was empty (some time) 
after 1/1/92." is represented as After[l / 1 / 92 , Past[e v , empty (tank2)]\, and "Tanic 2 
was empty (some time) before the bomb exploded." as: 

Before[Past[el v , explode(bomb)], Past[e2 v , empty (tank2)]] 



Ntense is used when expressing noun phrases (see section 2.11| ). Ntense[B, 0] means 
that at a time (3 something has the property specified by <j). Ntense[now* , (j)} means that 
something has the property specified by 4> at the present. The reading of "The president 
was visiting Edinburgh." that refers to the person who was the president during the 
visit is represented as Ntense[eV ,president(p v )] A Past[eV , visiting(p v , edinburgh)]. 
In contrast, the reading that refers to the current president is represented as: 



Ntense[now* ,president(p v )] A Past[el v ,visiting(p v , edinburgh)} 



CHAPTER 3. THE TOP LANGUAGE 78 

For[a c ,i/qty,(/>] means that cj> holds throughout a period that is v qty <r c -periods long. 
"Runway 2 was open for two days." is represented as: 

For[day c , 2, Past[e v , open(runway2)]] 

The Fills operator is currently not used in the framework of this thesis, but it could 
be used to capture readings of sentences like "Tank 2 was empty in 1992." whereby 
the situation of the verb holds throughout the period of the adverbial (see section 
2.9.2| ). At[l992, Past[e v , Fills[empty(tank2)}}} means that the tank was empty through- 



out 1992, while ^4i[1992, Past [e v , empty (tank2)]] means that the tank was empty some 
time in 1992, but not necessarily throughout 1992. 

Begin[(j)] means that <j) starts to hold, and End[(f)] means that <f> stops holding. For ex- 
ample, "BA737 started to land." can be represented as Past[e v , Begin[landing(ba737)]], 
and "Tank 2 stopped being empty." as Past[e v , End[empty(tank2)]]. 

Finally, Culm is used to represent sentences where verbs whose base forms are cul- 
minating activities appear in tenses that require some inherent climax to have been 



reached. The Culm operator will be discussed in section 3.9. 



Wh-formulae: Wh-formulae are used to represent questions that contain interrog- 
atives (e.g. "Which . . . ?", "Who . . . ?", "When . . . "?). WHFORMS is the set of all 
wh-formulae. WHFORMS d = WHFORMS! U WHFORMS 2 , where: 

• WHFORMSi is the set of all expressions of the form 1p\ ?/?2 • • • 4>, where 
Pi, /?2, • • • , n £ VARS, 4> £ YNFORMS, and each one of Pi, P2, ■ ■ ■ , P n occurs at 
least once within <f>. 

• WHFORMS2 is the set of all expressions of the form ? m xiPi ?/?2 ?/?3 • • • tp n (f), 
where fa, fa, Ps, ■ ■ ■ , Pn e VARS, <f) G YNFORMS, each one of p 2 , Pa, ■ ■ ■ , Pn 
occurs at least once within <f>, and Pi occurs at least once within (j) as the first 
argument of a Past, Perf, At, Before, After, or Ntense operator, or as the second 
argument of a Part operator. 

"?" is the interrogative quantifier, and l mx i the interrogative-maximal quantifier. The 
interrogative quantifier is similar to an explicit existential quantifier, but it has the 
additional effect of reporting the values of its variables that satisfy its scope. Intuitively, 
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7^2 t/3 n (ft means "report all @i, 02, ■ ■ ■ , Pn such that (ft". For example, "Which 
runways are open?" is represented as ?r v Ntense[now* , runway (r v )] A Pres[open(r v )}. 
The constraint that each one of (3\ , . . . , f3 n must occur at least once within (ft rules out 
meaningless formulae like ?o v Past[manager(John)], where the o v does not have any 
relation to the rest of the formula. This constraint is similar to the notion of safety 



in Datalog jUllman 88| , and it is needed in the translation from Top to Tsql2 of 
chapter ||| 

The interrogative-maximal quantifier is similar, except that it reports only maximal 
periods. f m xi is intended to be used only with variables that denote periods, and this 
is why in the case of ? mx i, 0i is required to occur within (ft as the first argument of 
a Past, Perf, At, Before, After, or Ntense operator, or as the second argument of a 
Part operator (the semantics of these operators ensure that variables occurring at these 
potitions denote periods). Intuitively, ? mx iPi ?/?2 t(3 n (ft means "report all the maximal 
periods (3\, and all (32, ■ ■■ , /3 n , such that (ft". The interrogative-maximal quantifier 
is used in "When ... ?" questions, where we want the answer to contain only the 
maximal periods during which a situation held, not all the periods during which the 
situation held. If, for example, gate 2 was open from 9:00am to 11:00am and from 
3:00pm to 5:00pm, we want the answer to "When was gate 2 open?" to contain only 
the two maximal periods 9:00am to 11:00am and 3:00pm to 5:00pm; we do not want 
the answer to contain any subperiods of these two maximal periods (e.g. 9:30am to 
10:30am). To achieve this, the question is represented as ? mx ie v Past[e v , open(gate2)]. 



Formulae: FORMS is the set of all Top formulae. FORMS = YNFORMS U 
WHFORMS. 



3.3 The temporal ontology 



Point structure: A point structure for Top is an ordered pair (PTS, -<), such that 
PTS is a non-empty set, -< is a binary relation over PTS x PTS, and {PTS, -<) has 
the following five properties: 

transitivity: If ti,t2,*3 £ PTS, t\ -< t2, and t2 -< £3, then t\ -< £3. 
irreflexivity: If t G PTS, then t -< t does not hold. 
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linearity: If t\, ti G PTS and t\ 7^ t2, then exactly one of the following holds: t\ -< ti 
or ti<t\. 

left and right boundedness: There is a tf irst G PTS, such that for all t G PPS, 
i/irsi ^ £• Similarly, there is a i/ aS i G PTS, such that for all t G PTS, t X i; aS f. 

discreteness: For every £x>^2 £ PTS, with ti 7^ £2, there is at most a finite number 
of £3 G PTS, such that ti -< t$ ~< ti. 

Intuitively, a point structure {PTS, -<) for Top is a model of time. Top models time 



as being discrete, linear, bounded, and consisting of time-points (see [ vanBenthem 83 ] 
for other time models.) PTS is the set of all time-points, and p\ -< pi means that the 
time-point pi precedes the time-point pi. 



prev(t) and next(t): If t\ G PTS — {ti ast }, then next(t\) denotes a t% G PTS, 
such that t\ -< t2 and for no t% G PTS is it true that t\ -< t% -< ti- Similarly, if 
t\ G PTS — {tfi rs t}, then prev(ti) denotes a ti G PTS, such that ti -< t\ and for no 
t% G PTS is it true that ti -< £3 -< ix- In the rest of this thesis, whenever next(t) is 
used, it is assumed that i 7^ fyast- Similarly, whenever prev(t) is used, it is assumed 
that t / t/irsf. 



Periods and instantaneous periods: A period p over (PTS, -<) is a non-empty 
subset of PTS 1 with the following property: 

convexity: If t\,t2 G p, £3 G PTS, and ti -< £3 -< ti, then £3 G p. 

The term "interval" is often used in the literature instead of "period" . Unfortunately, 
Tsql2 uses "interval" to refer to a duration (see chapter |5|). To avoid confusing the 
reader when Tsql2 will be discussed, I follow the Tsql2 terminology and use the term 
"period" to refer to convex sets of time-points. 

PERIODS \pts,-q is the set of all periods over (PTS,-<). If p G PERIODS \ PT s,-<) 
and p contains only one time-point, then p is an instantaneous period over (PTS, -<). 
INSTANTS 7 pts, -<) is ^ ne se ^ °f an instantaneous periods over (PTS, -<). For sim- 
plicity, I often write PERIODS and INSTANTS instead of PERIODS < PT s,-<} and 
INSTANTS (pts > an d I often refer to simply "periods" and "instantaneous periods" 
instead of "periods over (PTS, -<)" and "instantaneous periods over (PTS, -<)". 
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PERIODS* {PTS ^ (or simply PERIODS*) is the set PERIODS U {0}, i.e. PERIODS* 
is the same as PERIODS , except that it also contains the empty set. (The reader is 
reminded that periods are non-empty sets.) 

Subperiods: p\ is a subperiod of p2, iff Pi,P2 G PERIODS and p\ Cp 2 . In this case 
I write pi C p 2 - (pi C P2 is weaker than p\ C p 2 , because it does not guarantee that 
VXiVi £ PERIODS .) Similarly, p\ is a proper subperiod of p2, iff Pi,f?2 G PERIODS 
and pi C P2 • In this case I write p\ C P2 • 

Maximal periods: If S 1 is a set of periods, then mxlpers(S) is the set of maximal 
periods of S. mxlpers(S) *= G S | for no G S is it true that p C p'}. 

minpt(S) and maxpt(S): If S C PTS, minpt(S) denotes the time-point i 6 5, 
such that for every t' G S, t ^ i'. Similarly, if 5 C PTS, maxpt(S) denotes the 
time-point t £ S, such that for every t' G S, t' ■< t. 

Notation: Following standard conventions, [ti , ^2] denotes the set {t G PTS \ t\ X 
t if? £2}- (This is not always a period. If £2 -< ti, then [ii, ^2] is the empty set, which is 
not a period.) Similarly, (fj.^] denotes the set {t G PTS | t\ -< t ^ £2}- [^1,^2) and 
(^i , ^2) are defined similarly. 

3.4 TOP model 

A Top model M is an ordered 7-tuple: 

M — ( (-P TiS , ) , OBJS , fcons > fpfuns > f culms i fgparts j fcparts ) 



such that (PTS,-<) is a point structure for Top (section |3.3| ), PERIODS (pts,^) ^ 
05 JS*, and / corts , / c «; ms , / 5?M rts, and / cparis are as specified below. 

OBJS: OBJS is a set containing all the objects in the modelled world that can 
be denoted by Top terms. For example, in the airport domain OBJS contains all 
the gates and runways of the airport, the inspectors, the flights, etc. The constraint 
PERIODS (pts ,~<) ^ OBJS ensures that all periods are treated as world objects. This 
simplifies the semantics of Top. 
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fcons : fcons is a function CONS i— > OBJS. (I use the notation D ^ R to refer to 
a function whose domain and range are D and R respectively.) f CO ns specifies which 
world object each constant denotes. In the airport domain, for example, f CO ns may map 
the constants gate2 and 6a737 to some gate of the airport and some flight respectively. 

fpfuns= fpfuns is a function that maps each pair (Tv,n), where ir G PFUNS and n G 
{1,2,3, . . . }, to another function (OBJS) n h-> pow(PERIODS). (pow(S) denotes the 
powerset of S, i.e. the set of all subsets of S. (OBJS) n is the n-ary cartesian product 
OBJS x OBJS x • • • x OBJS.) That is, for every vr G PFUNS and each ra € {1, 2, 3, . . . }, 
fpfuns (iti n ) is a function that maps each n-tuple of elements of OBJS to a set of periods 
(an element of pow (PERIODS)). 

Intuitively, if t\,T2, . . . ,r n are Top terms denoting the world objects oi, 02, . . . , o n , 
fpfuns(ft, n)(oi, 02, . . . , o n ) is the set of the maximal periods throughout which the situ- 
ation described by 7t(ti, T2, . . . , r n ) is true. For example, if the constant 6a737 denotes 
a flight-object 01, gate2 denotes a gate-object 02, and be.at(ba737, gate2) describes the 
situation whereby the flight 01 is located at the gate 02, then f p f uns (be-at, 2)(oi, 02) 
will be the set that contains all the maximal periods throughout which the flight 01 is 
located at the gate 02. 

For every 7r G PFUNS and n G {1,2,3,...}, / p /u ns (7r, n) must have the following 
property: for every (01,02, . . . , o n ) G (OBJS) n , it must be the case that: 

ifpi,P2 G fpfuns (n,n) (oi,o 2 , ... ,o n ) andpiUp 2 G PERIODS, then pi = p 2 

This ensures that no two different periods pi,P2 in fpfuns(, 7 r, n )(°ii ■ ■ ■ >°n) overlap or 
are adjacent (because if they overlap or they are adjacent, then their union is also a 
period, and then it must be true that p\ = P2). Intuitively, if p\ and P2 overlap or are 
adjacent, we want f p funs(^i n){p\, 02, ■ ■ ■ , o n ) to contain their union p\ U P2 instead of 
pi and p 2 . 

fculms : f culms is a function that maps each pair (ir, n), where ir G PFUNS and n G 
{1, 2, 3, . . . }, to another function (OBJS) 11 1— > {T, F} (T, F are the two truth values). 
That is, for every ir G PFUNS and each n G {1, 2, 3, . . . }, / M j ms (ir, n) is a function that 
maps each n-tuple of elements of OBJS to T or F. 

f culms is only consulted in the case of predicates that represent actions or changes 
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that have inherent climaxes. If 7t(ti,t 2 , . . . ,r n ) represents such an action or change, 
and ti,t 2 , . . . ,r n denote the world objects o\, o 2 , . . . , o n , then / p / uns (7r, n)(o\, o 2 , . . . ,o n ) 
is the set of all maximal periods throughout which the action or change is ongoing. 
f culms (ft, n){o\, o 2 , . . . , o n ) shows whether or not the change or action reaches its climax 
at the latest time-point at which the change or action is ongoing. For example, if 
the constant j_adams denotes a person o\ in the world, bridgel denotes an object 
o 2 , and building(j_adams,ba737) describes the situation whereby o\ is building o 2 , 
fpfuns(building, 2)(oi, 02) will be the set of all maximal periods where 01 is building 02. 
f culms (building, 2) (01,02) will be T if the building is completed at the end-point of the 
latest maximal period in f p f uns (building, 2)(oi, o 2 ), and F otherwise. The role of f cu i ms 
will become clearer in section |3.9| . 

fgparts : fgparts is a function that maps each element of GPARTS to a gappy partition- 
ing. A gappy partitioning is a subset S of PERIODS , such that for every p\,p2 G S, 
Pi H P2 = 0, and UpesP 7^ PTS. For example, fg P arts(monday g ) could be the gappy 
partitioning of all Monday-periods. 

f*cparts : fcparts is a function that maps each element of CPARTS to a complete par- 
titioning. A complete partitioning is a subset S of PERIODS , such that for every 
P\,P2 S 5, pi n K = 0> and UpeS^ = PTS. For example, f C parts(day c ) could be the 
complete partitioning of all day-periods. 

3.5 Variable assignment 

A variable assignment with respect to (w.r.t.) a Top model M is a function g : 
VARS 1 — ^ OBJS (g assigns to each variable an element of OBJS). Gm, or simply G, 
is the set of all possible variable assignments w.r.t. M, i.e. G is the set of all functions 
VARS ^ OBJS. 

If g G G, j3 € K4i?S", and o G OBJS, then is the variable assignment defined as 
follows: gg(J3) = o, and for every 0' G with /?' / /3, c^(/3') = g(J3). 
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3.6 Denotation of a TOP expression 
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Index of evaluation: An index of evaluation is an ordered 3-tuple (st,et,lt), such 
that st G PTS, et G PERIODS, and It G PERIODS*. 

st (speech time) is the time-point at which the English question is submitted to the 
Nlitdb. et (event time) is a period where the situation described by a Top expression 
takes place. It (localisation time) can be thought of as a temporal window, within 
which et must be located. When computing the denotation of a Top formula that 
corresponds to an English question, It is initially set to PTS. That is, the temporal 
window covers the whole time-axis, and et is allowed to be located anywhere. Various 
operators, however, may narrow down It, imposing constraints on where et can be 
placed. 

Denotation w.r.t. M, st, et, It, g: The denotation of a Top expression £ w.r.t. 
a model M, an index of evaluation (st,et,lt), and a variable assignment g, is written 
|| £ || M,st,et,lt,g or s i m piy ||£|| st,et,Jt,p_ When the denotation of £ does not depend on st, 

et, and It, I often write ||£|| M ' 9 or simply ||f ||». 

The denotations w.r.t. M, st, et, It, g of Top expressions are defined recursively, starting 
with the denotations of terms and atomic formulae which are defined below. 

• If kg CONS, then \\k\\9 = f cons (n). 

• If G VARS, then \\(3\\9 = g(J3). 

• If <p G YNFORMS, then \\(f)\\^,et,lt,g £ {T,F}. 

The general rule above means that in the case of yes/no formulae, we only need to 
define when the denotation is T. In all other cases the denotation is F. 

• If <£i,02 G YNFORMS, then \\cpi A 2 ||^t,Jt, fl = T iff ||0i || s *> e *>^ = T and 

||0 2 ||st,et,;t,g _ 

• If <7 G PARTS, (3 G tfAAS', andi/ ord G {...,-3,-2,-1,0}, then || Part [a, (3, v ord ] ||» 
is T, iff all the following hold (below / = f cpa rts if f G CPARTS, and / = f gpa Hs 
if a G GPARTS): 
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- 9(P) 6 f(a), 

- if v ord = 0, then st G #(/3), 

- if f orf j < —1, then the following set contains exactly —v rd — 1 elements: 

{p G f(a) | maxpt(g((3)) -< minpt(p) and maxpt(p) ~< st} 

Intuitively, if v or d = 0, then [3 must denote a period in the partitioning that contains st. 
If Vara < —1, (3 must denote the — f or ^-th period of the partitioning that is completely 
situated before the speech time (e.g. if v or d = —4, (3 must denote the 4th period 
which is completely situated before st); that is, there must be —v rd ~ 1 periods in the 
partitioning that fall completely between the end of the period denoted by (5 and st 
(—(—4) — 1 = 3 periods if f or d = —4). 

For example, if f cpa rts (day c ) is the partitioning of all day-periods, then || Part [day c , f3,0]\\ 9 
is T iff g(/3) covers exactly the whole current day. Similarly, \\Part [day c , (3, — 1}\\ 9 is T 
iff g(/3) covers exactly the whole previous day. (Part [day c , f3, 0] and Part[day c , (3, — 1] 
can be used to represent the meanings of "today" and "yesterday"; see section |3.10[ ) 
The definition of Part could be extended to allow positive values as its third argument. 
This would allow expressing "tomorrow", "next January", etc. 



• If a € PARTS and (3 G VARS, then || Part [a, [3] \\9 = T iff g(/3) G f(a) (where 
/ = fcparts if ° G CP ARTS, and / = f gparts if a G GPARTS). 

Part[a,(3] is a simplified version of Parser, (3, v or d\, used when we want to ensure that 
g((3) is simply a period in the partitioning of a. 

• If 7T G PFfWS and n, r 2 , . . . , r n G TERMS, then [|ir(ri, r 2 , . . . , r n )|| st ' e *'"'f is T 
iff C Zi and for some p mxi G f p f U ns(^, »)(||ti II 9 , ll^ll 9 , ■ • • , lkn|| 9 ), et C p mxZ . 



Intuitively, for the denotation of a predicate to be T, ei must fall within It, and ei must 
be a subperiod of a maximal period where the situation described by the predicate 
holds. It is trivial to prove that the definition above causes all Top predicates to have 
the following property: 
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Homogeneity: A Top formula <fi is homogeneous, iff for every st G PTS, et G 
PERIODS, It E PERIODS*, and g £ G, the following implication holdsfl 

if et' C et and ||</>|| s *> et ^s = T, then \\<f,\\^^a = T 

Intuitively, if a predicate is true at some et, then it is also true at any subperiod et' 
of et. Although Top predicates are homogeneous, more complex formulae are not 



always homogeneous. Various versions of homogeneity have been used in | Allen 



| |Lascarides 88 ], [ Richards et al. 89[ , [Kent 93 1, [Pratt & Bree 95], and elsewhere. 



The denotation of a wh-formula w.r.t. st, et, It, and g is defined below. It is assumed 
that 0i, 2 , 3 , . . . , fa G VARS and <p G YNFORMS. 

• ]]?/?! ?/? 2 ...?/?„ <j)\\ st ^9 = {(#(/?i), 5 (/3 2 ),..., 9 (/? n )) | \\<f>\\^t>9=T} 

That is, if ||«/,|| s *> e *A<? = T, then \\?p 1 ?/3 2 ... ?/?„ <j)\\ st > et > lt '9 is a one-element set; it con- 
tains one tuple that holds the world-objects assigned to 0±, 02, ■ ■ ■ , n by g. Otherwise, 
\\10i ?/% ... ?/3 n 0|| s *' et '"' 9 is the empty set. 



||?m^l ?& ?/% ...?Pn4>\\ St ' et ' H ' 9 = 

{(#),#),#3),..,S(«) I [|0|| st ' e ^ = T, and 
for no et' G PERIODS and g' £ G is it true that = T, 

5(A) c g'(Pi), g(Jh) = g'(fo), g(0s) = g'(0s), g{0 n ) = g'(0n)} 



The denotation \\? mx i0i 102 ?/% ••• ?/?n </)|| st > et ' Z *> 9 is either a one-element set that 
contains a tuple holding the world-objects g(0i), g(02), ■ ■ ■ ,g{0n), or the empty set. 
Intuitively, the denotation of ? mx l01 ?/?2 ?/% • • • ?/?n contains the values assigned 
to 0i, Pi-, 03, • • • , Pn by g, if these values satisfy 4>, and there is no other variable as- 
signment g' that assigns the same values to 02, 03, ■ ■ ■ , n , a superperiod of g{0\) to 
01, and that satisfies <f> (for any et' G PERIODS). That is, it must not be possible 
to extend any further the period assigned to 0\ by g, preserving at the same time 
the values assigned to 02,03, ■ ■ ■ ,0n, an d satisfying <j>. Otherwise, the denotation of 
?mxl0i 702 7 -03 ■■■ 10 n 4> is the empty set. 



The syntax of Top (section |3.2| ) requires 0\ to appear at least once within (f> as the 
first argument of a Past, Perf, At, Before, After, or Ntense operator, or as the second 
The term "homogeneit y" is also used i n the temporal databases literature, but with a completely 



different meaning; see [Jensen et al. 93 1 
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argument of a Part operator. The semantics of these operators require variables oc- 
curring at these positions to denote periods. Hence, variable assignments g that do not 
assign a period to ft will never satisfy (f>, and no tuples for these variable assignments 
will be included in \\lmxlPl ?fo ?ft • • • ?A» 0|| s *- et >^. 

The rules for computing the denotations w.r.t. M, st, et, It, g of other Top expressions 
will be given in following sections. 

Denotation w.r.t. M, st: I now define the denotation of a Top expression with 
respect to only M and st. The denotation w.r.t. M, st is similar to the denotation w.r.t. 
M, st, et, It, g, except that there is an implicit existential quantification over all g G G 
and all et G PERIODS , and It is set to PTS (the whole time-axis). The denotation of 
<f) w.r.t. M,st, written ||(/>|| M ' si or simply ||</>||' st , is defined only for TOP formulae: 

• If (/) G YNFORMS, then ||c/>|| st = 

- T, if for some g G G and et G PERIODS, W^ft^PTS ,g _ ^ 

— F, otherwise 

. If € WHFORMS, then = U geG , ^PERIODS U\\ st > et ' PTS '° . 

Each question will be mapped to a Top formula (if the question is ambiguous, 
multiple formulae will be generated, one for each reading). \\4>\\ st specifies what the 
Nlitdb's answer should report. When <fi G YNFORMS, \\4>\\ st = T (i.e. the answer 
should be "yes") if for some assignment to the variables of (ft and for some event time, (j) 
is satisfied; otherwise \\<J)\\ st = F (the answer should be "no"). The localisation time is 
set to PTS (the whole time-axis) to reflect the fact that initially there is no restriction 
on where et may be located. As mentioned in section |3.6| , however, when computing 
the denotations of the subformulae of <j>, temporal operators may narrow down the 
localisation time, placing restrictions on et. 

In the case where G WHFORMS (i.e <p =?ft ... ?/?„ <f/ or <j) =l m xifi\ ■■■ ?/?n <P' 
with cj)' G YNFORMS), is the union of all \\(p\\ st > et ' PTS '° , for any g G G and 

et G PERIODS. For each g G G and et G PERIODS, \\(j)ft,et,PTS ,g ig either an empty 
set or a one-element set containing a tuple that holds values of /?i , (3% , , . . . , (3 n that 
satisfy <f)' (ft must be maximal if G WHFORMS 2 ). Hence, (the union of all 

\\<fi\\ st > et ' PTS '9) i s the set of all tuples that hold values of ft, ft, ft, • • • , ft that satisfy 
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Figure 3.1: "Is BA737 at gate 2?" 



(j)'. The answer should report these tuples to the user (or be a message like "No answer 
found.", if ||(/>|| st = 0). 

3.7 The Pres operator 

The Pres operator is used to express the simple present and present continuous tenses. 
For 4> £ YN FORMS: 

• \\Pres\(h}\\ st > et > lt ' 9 = T, iff st G et and \U\\ st ^9 = T _ 



for example, is represented as (| 

(3.3) Is BA737 at gate 2? 

(3.4) Pres [be.at (6a737, gate2)] 



Let us assume that the only maximal periods where BA737 was/is/will be at gate 2 



are Pmxh and p mx i 2 (i-e. (3.5) holds; see section 3^), and that (|3.3j ) is submitted at a 



time-point sti, such that (3.6) holds (figure 3.1 ) 



(3.5) f pfuns (be.at,2)(f cons (ba737)J cons (gate2)) = {p mxh , p m xi 2 } 

(3.6) Stl £ Pmxh 



The answer to (gj) will be affirmative iff (j3^) is T 



(3.7) \\Pres[be-at{ba737,gate2)]\\ stl 

According to section |y| (|^) is T iff for some g e G and et G PERIODS, holds. 

(3.8) ||Pres[6e_at(6a737, 5 ate2)]|| stl ' ei ' PT5 ' 5 = T 
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By the definition of Pres, (jO) holds iff both (3.9) and (3.10) hold. 



(3.9) sh € et 

(3.10) \\be-at(ba737,gate2)\\ stl > et > PTS >9 = T 

By the definitions of ||tt(ti, . . . , Tn )\\*t,et,it,g an d || K ||<? ( sec ti n gj), (3.10) holds iff for 
some p mx u (3.11) - (3.13) hold. 

(3.11) et C PTS 

(3.12) p mx i G f pfuns (be.at, 2) (f cons (ba737) , f cons (gate2)) 

(3.13) et C 

By (3.5), (3.12) is equivalent to (3.14). 

(3.14) p mx i e {PmxhiPmxh) 



The answer to ( [3.3D will be affirmative iff for some et G PERIODS and some p ma; 2, 
(3.9), (3.11), (3.13), and (3.14) hold. For p mxi = p mxh , and et any subperiod of p mxh 
that contains sti (figure |Q|), (3.9), (3.11), (3.13), and (3.14) hold. Hence, the answer 
to (|3.3| ) will be affirmative, as one would expect. If the question is submitted at an 
st2 that falls outside p m xh an d Pmxi 2 (figure |3.1| ), then the answer will be negative, 
because in that case there is no subperiod et of p m xh or p mx i 2 that contains st2- 

The present continuous is expressed similarly. For example, the reading of fl3.15|) where 
Airserve is actually servicing BA737 at the present moment is expressed as ( |3.16| ). Un- 
like [ Dowty 77| |, [ Lascarides 88 1, [ Pirie et al. 90 ], and [ Crouch fc Pulman 93| ], in Top 



progressive tenses do not introduce any special progressive operator. This will be 



discussed in section 3.£. 



(3.15) Airserve is (actually) servicing BA737. 

(3.16) Pres[servicing(air serve, 6a737)] 

The habitual ( p.!7| ) is represented using a different predicate functor from that of 

( 3.15| ), as in ( 3.18j ), As will be explained in chapter |4], (3.15) is taken to involve a non- 



habitual homonym of "to service", while ( |3.17 ) is taken to involve a habitual homonym. 



The two homonyms introduce different predicate functors. 



(3.17) Airserve (habitually) services BA737. 
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(3.18) Pres[hab_server_of (air serve, ba737)] 



90 



Top's Pres operator is similar to that of [Pirie et al. 90]. The main difference is that 



the Pres of Pirie et al. does not require st to fall within et. Instead, it narrows It to 
start at or after st. This, in combination with the requirement et C It, requires et to 



start at or after st. Using this version of Pres in (3.4) would cause the answer to be 



affirmative if (3.3) is submitted at st2 (figure 3T), i.e. at a point where BA737 is not 
at gate 2, because there is an et at which BA737 is at gate 2 (e.g. the et of figure 3.1), 
and this et starts after st2- This version of Pres was adopted by Pirie et al. to cope 
with sentences like "J.Adams inspects BA737 tomorrow." , where the simple present 
refers to a future inspection (section |2.5.l| ). In this case, et (inspection time) must be 
allowed to start after st. 

The Pres of Pirie et al. is often over-permissive (e.g. it causes the answer to be affir- 
mative if (|3.3| ) is submitted at st?)- Pirie et al. employ a post-processing mechanism, 
which is invoked after the English sentence is translated into logic, and which attempts 
to restrict the semantics of Pres in cases where it is over-permissive. In effect, this 
mechanism introduces modifications in only one case: if the Pres is introduced by a 
state verb (excluding progressive states) which is not modified by a temporal adverbial, 
then et is set to {st}. For example, in "J.Adams is at site 2." where the verb is a state, 
the mechanism causes et to be set to {st}, which correctly requires J.Adams to be at 
gate 2 at st. In "J.Adams is at site 2 tomorrow." , where the state verb is modified by a 
temporal adverbial, the post-processing has no effect, and et (the time where J.Adams 
is at site 2) is allowed to start at or after st. This is again correct, since in this case 
et must be located within the following day, i.e. after st. In "J.Adams is inspecting 
site 2", where the verb is a progressive state, the post-processing has again no effect, 
and et (inspection time) can start at or after st. The rationale in this case is that et 
cannot be set to {st}, because there is a reading where the present continuous refers to 
a future inspection (section |2.5.3| ). For the purposes of this project, where the futurate 
readings of the simple present and the present continuous are ignored, Top's Pres is 
adequate. If, however, these futurate readings were to be supported, a more permissive 
Pres operator, like that of Pirie et al., might have to be adopted. 
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3.8 The Past operator 
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The Past operator is used when expressing the simple past, the past continuous, the 
past perfect, and the present perfect (the latter is treated as equivalent to the simple 



past; section |X|). For 6 YN FORMS and G VARS: 



\Past[P,(f>]\\ st ' et ' H 'S = T, iff g(P) = et and \\^\\st,et,ltn{t SirsU st),g = T 



The Past operator narrows the localisation time, so that the latter ends before st. et 
will eventually be required to be a subperiod of the localisation time (this requirement 
will be introduced by the rules that compute the denotation of <p). Hence, et will be 
required to end before st. (3 is used as a pointer to et (the definition of Past[/3,(f)] 
makes sure that the value of (5 is et). (3 is useful in formulae that contain Ntenses 



(to be discussed in section 3,13 ). It is also useful in time-asking questions, where 
et has to be reported. For example, "When was gate 2 open?" is represented as 
t mx ie v Past[e v , open (gat e2)], which reports the maximal ets that end before st, such 
that gate 2 is open throughout et. 

Top's Past operator is essentially the same as that of | Pirie et al. 90| ] . (A slightly 
different Past operator is adopted in |Crouch fc Pulman 93|1.) 



3.9 Progressives, non-progressives, and the Culm opera- 
tor 



Let us now examine in more detail how Top represents the simple past and the past 
continuous. Let us start from verbs whose base forms are culminating activities, like 
"to inspect" in the airport domain. The past continuous ( 3.19 ) is represented as ( 3.20| ). 



(3.19) Was J.Adams inspecting BA737? 

(3.20) Past[e v , inspecting (j _adams , 6a737)] 



Let us assume that the inspection of BA737 by J.Adams started at the beginning of 
Pmxh (figure [3^2] ) , that it stopped temporarily at the end of Pmxln that it was resumed 
at the beginning of p m xi 2 > an d that it was completed at the end of p mx i 2 . Let us also 
assume that there is no other time at which J.Adams was/is/will be inspecting BA737. 
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2 I inspection completed 

j inspecting ^ ^ inspecting ^ 

^mxll ^mxl2 

i h— i i i 

st 3 ei i St 2 Stj 
Figure 3.2: "Was J.Adams inspecting BA737?" vs. "Did J.Adams inspect BA737?" 

Then, (3.21) and (3.22) hold. 
(3.21) 

fpfuns (inspecting, 2) (f cons (j.adams) , f cons (ba737) ) = {p™^ , p mx i 2 } 
(3.22) f cu i ms (inspecting 1 2)(f cons (j_adams),f cons (ba737)) = T 



The reader can check that ( |3.23D is T iff there is an et that is a subperiod of p m xh or 



Vmxhi an£ i that ends before si. 

(3.23) \\Past[e v ,inspecting(j-adams, 6a737)]|| s * 

If (3.19) is submitted at st\ or st2 (figure 3.2), then ( |3.23 ) is T (the answer to (3.19) 



will be "yes"), because in both cases there is an et (e.g. the et\ of figure |3.2|) that ends 
before st\ and st2, and that is a subperiod of p m xh- in contrast, if the question is 
submitted at 5*3, (|3.23| ) is F (the answer will be negative), because in this case there 
is no subperiod of p m xh or p mx i 2 that ends before st^. This is what one would expect: 
at st± and st2 the answer to ( 3.19| ) should be affirmative, because J.Adams has already 



spent some time inspecting BA737. In contrast, at st^ J.Adams has not yet spent any 
time inspecting BA737, and the answer should be negative. 

Let us now consider the simple past ( |3.24j ). We want the answer to be affirmative if 
( 3.24| ) is submitted at sii ( or cLny other time-point cifter the end of Pmxi2 

), but not if 



it is submitted at st2 (or any other time-point before the end of p m xl 2 )i because at st2 
J.Adams has not yet completed the inspection (section |2.5.2| ). 

(3.24) Did J.Adams inspect BA737? 



( 3.24| ) cannot be represented as ( p.20 ), because this would cause the answer to ( 3.24[ ) 



to be affirmative if the question is submitted at st2 ■ Instead, ( |3.24| ) is represented 
as ( p. 25 ). The same predicate inspecting(j_adams,ba737) of ( |3.20| ) is used, but an 



additional Culm operator is inserted. 

(3.25) Past[e v , Culm[inspecting(j_adams,ba737)]] 
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Intuitively, the Culm requires the event time to be the et2 of figure 3.2, i.e. to cover the 
whole time from the point where the inspection starts to the point where the inspection 
is completed. (If the inspection is never completed, Culm causes the denotation of 
( |3.25| ) to be F.) Combined with the Past operator, the Culm causes the answer to be 
affirmative if (|3.24[) is submitted at st\ (because et2 ends before st\), and negative if 
the question is submitted at st2 (because et2 does not end before s^)- More formally, 
for 7T G PFUNS and n, ... , r„ G TERMS: 

. \\Gulmfr(T 1 ,...,T n )]\\' t <* lt * = T, iff et C It, / cute (7r,n)(||rip, . . . , || Tn ||*) = T, 
S ^ 0, and et = [minpt(S),maxpt(S)], where: 

S= U P 

P£fpfiins{Tr,n)(\\Tl\\9,...,\\T n \\ g ) 

The et = [minpt(S) , maxpt(S)] requires et to start at the first time-point where the 
change or action of vr(ri, . . . , r n ) is ongoing, and to end at the latest time-point where 
the change or action is ongoing. The / c «; m s( 7r )(||' r i \\ 9 , ■■ ■ , ll r n|| 9 ) = T means that the 
change or action must reach its climax at the latest time-point where it is ongoing. 



Let us now check formally that the denotation ( |3.26| ) of ( |3.25| ) is in order. 
(3.26) ||Pasi[e' y , Culm[inspecting(j _adams, ba737)]]\\ st 

(HH) is T iff for some g G G and et G PERIODS, ( gjgg) holds. 



(3.27) \\Past[e v , Culm[inspecting(j_adams, ba737)]]\\ st ' et ' PTS ' 9 = T 

By the definition of Past, (ET27D holds iff (3.28) and (3.29) hold (PTS n [tf irst ,st) 



[t first, St)). 

(3.28) g(e v ) = et 

(3.29) || Culm[inspecting(j_adams, ba737)]\\ st ' et ' [t f' rst ' st) ' 9 = 

By the definition of Culm, (3.29) holds iff (3.30) - (3.34) hold. 

(3.30) et C [t fi rs t, st) 

(3.31) f cu i ms (inspecting, 2) (f cons (j_adams),f cons (ba737)) = 

(3.32) S^(D 

(3.33) et = [minpt(S), maxpt(S)] 

(3.34) S= |J p 

P&fpfuns (inspecting, 2) (/ co „ s (j .adams) jams (6a737)) 
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By (3.21), and assuming that maxpt(p mx i 1 ) -< minpt(p mx i 2 ) (as in figure f3^ ), (3.32) - 

(3.34) are equivalent to (3.35) - (3.36). (3.35) holds (the union of two periods is never 
the empty set), and (3.31) is the same as (3.22), which was assumed to hold. 

(3.35) Pmxh u Pmxl 2 / 

(3.36) et = [minpt(p mxll ) , maxpt(p mx i 2 )] 



Hence, fl3~2S; ) is T (i.e. the answer to ( |3.24[) is affirmative) iff for some g & G and 



et £ PTS, (3.28), (3.30), and (3.36) hold. Let eti = [minpt(p mx i 1 ),maxpt(p mx i 2 )] (as 
in figure 



Let us assume that ( |3.24 ) is submitted at an st that follows the end of et 2 (e.g. st\ 



in figure 3^). For et = etz, (3.30) and (3.36) are satisfied. (3.28) is also satisfied by 
choosing g = g\, where g\ as below. Hence, the answer to fl3.24j ) will be affirmative, as 
required. 



et 2 HP = e v 

o otherwise (o is an arbitrary element of OBJS) 



In contrast, if the question is submitted before the end of et 2 (e.g. st2 or st^ in figure 
3.2), then the answer to ( |3.24| ) will be negative, because there is no et that satisfies 



(3.30) and (3.36). 

In the case of verbs whose base forms are processes, states, or points, the simple past 
does not introduce a Culm operator. In this case, when both the simple past and the 
past continuous are possible, they are represented using the same Top formula. (A 
similar approach is adopted in parsons 89 1.) For example, in the airport domain where 



"to circle" is classified as process, both ( |3.37D and ( 3.38 ) are represented as ( |3,39| ). 



(3.37) Was BA737 circling? 

(3.38) Did BA737 circle? 

(3.39) Past[e v ,circling(ba737)} 



The reader can check that the denotation of ( [3.39 ) w.r.t. st is T (i.e. the answer to (|3T37|) 



and ( |3.38| ) is affirmative) iff there is an et which is a subperiod of a maximal period 
where BA737 was circling, and et ends before st. That is, the answer is affirmative iff 
BA737 was circling at some time before st. There is no requirement that any climax 
must have been reached. 
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The reader will have noticed that in the case of verbs whose base forms are culminating 
activities, the (non-progressive) simple past is represented by adding a Culm operator 
to the expression that represents the (progressive) past continuous. For example, 
assuming that "to build (something)" is a culminating activity, ( |3.40| ) is represented 
as ( g!4ll) , and (ET421 ) as fljlj) ). 



(3.40) Housecorp was building bridge 2. 

(3.41) Past[e v , building(housecorp, bridge2)] 

(3.42) Housecorp built bridge 2. 

(3.43) Past[e v , Culm[building(housecorp,bridge2)]] 



In contrast, in Dowty 77 1, | Lascarides 88 ], [ Pirie e£ a/. 90(| , | Crouch fe Pulman 



and [ Kamp fc Reyle 9j| , progressive tenses are represented by adding a progressive 



operator to the expressions that represent the non-progressive tenses. For example, 
ignoring some details, Pirie et al. represent flOOp and (|0|) as flO|) and (|3~45| ) 
respectively. 

(3.44) Pas^e^, Prog[build(housecorp, bridge2)]] 

(3.45) Pas^e^, build(housecorp, bridge2)] 



In ( |3.45 ), the semantics that Pirie et al. assign to build(housecorp,bridge2) require et 



to cover the whole building of the bridge by Housecorp, from its beginning to the point 
where the building is complete. (The semantics of Top's building(housecorp, bridge2) 
in ( 3.41[ ) require et to be simply a period throughout which Housecorp is building 



bridge 2.) The Past of ( 3.45| ) requires et (start to completion of inspection) to end 



before st. Hence, the answer to ( p.42 ) is affirmative iff the building was completed 
before st. 



In ( 3.44 ), the semantics that Pirie et al. assign to Prog require et to be a subperiod of 
another period et' that covers the whole building (from start to completion; figure |3,3D . 
The Past of (|]fj) requires et to end before st. If, for example, (|3.40| ) is submitted 



at an st that falls between the end of et and the end of et' (figure |3.3|) , the answer 
will be affirmative. This is correct, because at that st Housecorp has already been 
building the bridge for some time (although the bridge is not yet complete) . The Prog 
of Pirie et al., however, has a flaw (acknowledged in [ Crouch fc Pulman 93(| ): ( 3.44 ) 



implies that there is a period et', such that the building is completed at the end of 
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1 el 1 

I — building complete 

I eT 1 



1 building started I 

St 

Figure 3.3: A flawed Prog operator 



et'; i.e. according to fl3.44f) the building was or will be necessarily completed at some 
time-point. This does not capture correctly the semantics of ( |3,40 ). ( 3.40| ) carries no 
implication that the building was or will ever be completed. (Top's representation of 
( 3.40| ), i.e. ( |3,41 ), does not suffer from this problem: it contains no assumption that 
the building is ever completed.) To overcome similar problems with Prog operators, 
"branching" models of time or "possible worlds" have been employed (see, for example, 
| Dowty 77 ] , [ McDermott 82 ] , [ Mays 8€ ] , [ Kent 93f| ; see also | Lascaridcs 88 ] for criticism 



of possible- worlds approaches to progressives.) Approaches based on branching time 
and possible worlds, however, seem unnecessarily complicated for the purposes of this 
thesis. 

3.10 The At, Before, and After operators 



The At, Before, and After operators are used to express punctual adverbials, period 
adverbials, and "while ... ", "before ... ", and "after ... " subordinate clauses (sec- 
tions |J and ETToD . For <p, fa, fat YNFORMS and r G TERMS: 



\At\T,6\\\ st > et > lt >9 = T, iff llrll 5 G PERIODS and \U\\ st ' et ' ltn ^ 9 ^ = T 



\\At[fa,fa]\\ st > et ' lt '9 = T, iff for some et' 

et' £ mxlpers({e £ PERIODS | \\fa || si > e > PTO >9 = T}) and ||^ 2 ||«f,Bt,ltnef , fl _ T , 



If the first argument of At is a term r, then r must denote a period. The localisation 
time is narrowed to the intersection of the original It with the period of r. If the 
first argument of At is a formula (p\, the localisation time of fa is narrowed to the 
intersection of the original It with a maximal event time period et' at which (pi holds. 



For example, (3.46) is represented as (3.47). 



(3.46) Was tank 2 empty (some time) on 25/9/95? 
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In ( p.47| ), It initially covers the whole time-axis. The At operator causes It to become 
the 25/9/95 period (I assume that the constant 25/9/95 denotes the obvious period), 
and the Past operator narrows It to end before st (if 25/9/95 is entirely in the past, 
the Past operator has not effect). The answer to fl3.46|) is affirmative iff it is possible 
to find an et that is a subperiod of the narrowed It, such that tank 2 was empty during 
et. 



If (|3.46f) is submitted before 25/9/95 (i.e. 25/9/95 starts after st), the Nlitdb's answer 



will be negative, because the At and Past operators cause It to become the empty set, 
and hence it is impossible to find a subperiod et of It where tank 2 is empty. A simple 
negative response is unsatisfactory in this case: ( [3.46 ) is unacceptable if uttered before 



25/9/95, and the system should warn the user about this. The unaccept ability of ( 3.46| ) 



in this case seems related to the unacceptability of ( 3.48 ), which would be represented 
as fl3.49|) (the definition of Part would have to be extended to allow positive values of 
its third argument; see section |3.6| ). 



(3.48) *Was tank 2 empty tomorrow? 

(3.49) Part[day c , torn", 1] A At[tom v , Past[e v , empty (tank2)]\ 

In both cases, the combination of the simple past and the adverbial causes It to become 
the empty set. In ( |3.48| ), torn" denotes the period that covers exactly the day after 
st. The At and Past operators set It to the intersection of that period with [tfi rs t, st). 
The two periods do not overlap, and hence It = 0, and it is impossible to find a 
subperiod et of It. This causes the answer to be always negative, no matter what 
happens in the world (i.e. regardless of when tank 2 is empty). Perhaps the questions 
sound unacceptable because people, using a concept similar to Top's It, realise that the 
answers can never be affirmative. This suggests that the Nlitdb should check if It = 0, 



and if this is the case, generate a cooperative response (section 1.4) explaining that the 



question is problematic (this is similar to the "overlap rule" of [ [Harper fe Charniak 86 



and the "non-triviality constraint" on p. 653 of [Kamp & Reyle 93]). The framework 



of this thesis currently provides no such mechanism. 

Moving to further examples, (|3.50D and (3.53) are represented as (3.52) and (3.54). 



Unlike the "on 25/9/95" of ( 3.46[ ), which is represented using a constant (25/9/95) 
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the "on Monday" of (3.50) is represented using a variable {mori") that ranges over the 



periods of the partitioning of Monday-periods. Similarly, the "at 5:00pm" of ( 3.53j ) is 
represented using a variable (fv v ) that ranges over the 5:00pm minute-periods. 

(3.50) Was tank 2 empty on Monday? 

(3.51) Was tank 2 empty on a Monday? 

(3.52) Part[monday 9 , mon v ] A At[mon v , Past[e v , empty (tank2)]] 

(3.53) Was tank 2 empty on Monday at 5:00pm? 

(3.54) Part[monday 9 ,mon v ] A Part[5:00pm g , fv v ] A 
At[mon v , At[fv v , Past[e v , empty(tanfc2)]]] 



( 3.52j ) requires tank 2 to have been empty at some past et that falls within some 



Monday. No attempt is made to determine exactly which Monday the user has in 
mind in ( 3.50| ) (( |3.50 ) is treated as equivalent to ( 3.51[ ); section |2.12| ). Similarly, (3.54) 



requires tank 2 to have been empty at some past et that falls within the intersection 
of some 5:00pm-period with some Monday-period. 

Assuming that "to inspect" is a culminating activity (as in the airport application), the 



reading of ( 3.55 ) that requires the inspection to have both started and been completed 



within the previous day (section [2.9.2 ) is represented as ( |3.56|) . The Culm requires et 



to cover exactly the whole inspection, from its beginning to its completion. The Past 
requires et to end before st, and the At requires et to fall within the day before st. 

(3.55) Did J.Adams inspect BA737 yesterday? 

(3.56) Part[day c , y v , —1] A At[y v , Past[e v , Culm[inspecting(j_adams, 6a737)]]] 



In contrast, ( 3.57| ) is represented as ( 3.58 ). In this case, et must be simply a subperiod 



of a maximal period where J.Adams was inspecting BA737, and also be located within 
the previous day. 

(3.57) Was J.Adams inspecting BA737 yesterday? 

(3.58) Part[day c , y v , —1] A At[y v , Past[e v ,inspecting(j-adams, 6a737)]] 



Finally, ( 3.59 ) is represented as ( 3.60 ), which intuitively requires BA737 to have been 
circling at some past period e2 v , that falls within some past maximal period el v where 
gate 2 was open. 



CHAPTER 3. THE TOP LANGUAGE 

(3.59) Did BA737 circle while gate 2 was open? 

(3.60) At[Past[er,open(gate2)],Past[e2 v ,circling(ba737)}} 



99 



The Before and After operators are similar. They are used to express adverbials and 
subordinate clauses introduced by "before" and "after". For (f>, fa, fa G YNFORMS 
and r E TERMS: 



• || Before [t, = T, iff ||r|| 9 E PERIODS and 

ist,et,Wn[t/i rst ,minpt(||T||»)),p _ y_ 



\\Before[fa,fa]\\ st ' et > lt '9 = T, iff for some et' 

et' G mxZpers({e G PERIODS \ \\fa\\ st > e ' PTS '9 = T}), and 
||0 2 ||st,et,Jtn[t/i rat ,minpt(et / )),s _ j 1 , 

||4fter[T,(/>]|| st ' e *'^ = T, iff ||t|| 9 G PERIODS and 

st,et,itn(maxpi(||T||s),i ias t],g _ 



||A/ter[0i,^2]|| s *' e *''*' 9 = 2 1 , iff for some ef 
et' G mxlpers({e \ \\4> 1 \\ st > e > PTS >9 = T}) and 
\\^»st,et,ltn(maxpt(et'),ti ast ],9 — J" 



If the first argument of Before is a term r, r must denote a period. The localisation time 
is required to end before the beginning of r's period. If the first argument of Before 
is a formula <j)\, the localisation time of fa is required to end before the beginning of 
a maximal event time period et' where fa holds. The After operator is similar. 



For example, ( 3.61[) is expressed as ( |3.62 ), and the reading of ( 3.63j ) that requires BA737 



to have departed after the end of a maximal period where the emergency system was 



in operation is expressed as ( 3.64 ). (I assume here that "to depart" is a point, as in 
the airport application.) 

(3.61) Was tank 2 empty before 25/9/95? 

(3.62) Before [25/9/95, Past [e v , empty (tank!)]] 

(3.63) BA737 departed after the emergency system was in operation. 

(3.64) After[Past[el v , in-operation(emerg_sys)], Past[e2 v , depart(ba737)]] 
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( 3.63| ) also has a reading where BA737 must have departed after the emergency system 



started to be in operation (section |2.10.2| ). To express this reading, we need the Begin 
operator of section |3. 12 below. 



Top's At, Before, and After operators are similar to those of [Pirie et al. 90|. The 
operators of Pirie et al., however, do not narrow It as in Top. Instead, they place 
directly restrictions on et. For example, ignoring some details, the After[</>i,fa] of 
Pirie et al. requires fa to hold at an event time et2 that follows an et\ where fa holds 
(both et\ and eti must fall within It). Instead, Top's After[fa, fa] requires et\ to be 
a maximal period where fa holds {et\ does not need to fall within the original It), and 
evaluates fa with respect to a narrowed It, which is the intersection of the original 
It with et\. In most cases, both approaches lead to similar results. Top's approach, 
however, is advantageous in sentences like ( |3.65| ), where one may want to express the 
reading whereby the tank was empty throughout 26/9/95 (section p. 9. 2 ). 



(3.65) Tank 2 was empty on 26/9/95. 

(3.66) At[26/9/95,Past[e v ,empty(tank2)]} 

In these cases one wants et (time where the tank was empty) to cover all the available 
time, where by "available time" I mean the part of the time-axis where the tense 
and the adverbial allow et to be placed. This notion of "available time" is captured 
by Top's It: the simple past and the "on 26/9/95" of ( 3.65| ) introduce At and Past 



operators that, assuming that ( |3.65[ ) is submitted after 26/9/95, cause It to become 
the period that covers exactly the day 26/9/95. The intended reading can be expressed 
easily in Top by including an additional operator that forces et to cover the whole It 
(this operator will be discussed in section 3.11| ). This method cannot be used in the 



language of Pirie et al. Their Past operator narrows the It to the part of the time-axis 
up to st, but their At does not narrow It any further; instead, it imposes a direct 
restriction on et (the semantics of Pirie et al.'s At is not very clear, but it seems that 
this restriction requires et to be a subperiod of 26/9/95). Hence, It is left to be the 
time-axis up to st, and one cannot require et to cover the whole It, because this would 
require the tank to be empty all the time from i/j rs t to st. 

The At operator of Pirie et al. also does not allow its first argument to be a formula, and 
it is unclear how they represent "while ..." clauses. Finally, Pirie et al.'s Before allows 



counter-factual uses of "before" to be expressed (section 2.10.2). Counter-factuals are 
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not considered in this thesis, and hence Pirie et al.'s Before will not be discussed any 
further. 

3.11 The Fills operator 



As discussed in section 2.9.2, when states combine with period adverbials, there is often 
a reading where the situation of the verb holds throughout the adverbial's period. For 
example, there is a reading of (|3.67| ) where tank 2 was empty throughout 26/9/95, not 
at simply some part of that day. 

(3.67) Tank 2 was empty on 26/9/95. 

Similar behaviour was observed in cases where states combine with "while ... " sub- 
ordinate clauses (section |2. 10. 1 ) . For example, there is a reading of ( 3,6§| ) whereby 



BA737 was at gate 2 throughout the entire inspection of UK160 by J.Adams, not at 
simply some time during the inspection. 

(3.68) BA737 was at gate 2 while J.Adams was inspecting UK160. 



It is also interesting that ( |3.69 ) cannot be understood as saying that tank 2 was empty 



throughout the period of "last summer". There is, however, a reading of ( |3,69| ) where 
tank 2 was empty throughout the August of the previous summer. 

(3.69) Tank 2 was empty in August last summer. 

It seems that states give rise to readings where the situation of the verb covers the 
whole available localisation time. ( p. 70 ) - ( 3.72j ) would express the readings of ( |3.67 ) - 



( 3.69| ) that are under discussion, if there were some way to force the event times of the 



predicates empty (tank2), 6e_at(6a737, gate2), and empty {tank2) to cover their whole 
localisation times. 

(3.70) At[26/9/95, Past [e v , empty (tank2)]] 

(3.71) At[Past[eV , inspecting^ _adams , ukl60)], Past [e2 v , 6e_at(6a737, gate2)]] 

(3.72) Part[august 9 , aug v ] A Part[summer 9 , sum", —1} A 
At[aug v , At[sum v , Past[e v , empty (tank2)]]] 



The Fills operator achieves exactly this: it sets et to the whole of It. For £ 
YN FORMS: 
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• \\Fills[^]\\ st ' et ' lt '9 = T, iff et = It and \m^,et,lt,g = t. 



The readings of JOtD - (^69| ) that are under discussion can be expressed as ( |3.73| ) 



( 3.75| ) respectively. 



(3.73) At[26/9/95, Past [e v , Fills [empty (tank2)}]} 

(3.74) At[Past[eV , inspecting^ _adams , ukldO)], 
Past[e2 v , Fills[be.at(ba737, gate2)])] 

(3.75) Part[august 9 , aug v ] A Part[summer 9 , sum", —1] 
At[aug v , At[sum v , Past[e v , Fills[empty(tank2)}]]] 

This suggests that when state expressions combine with period-specifying subordinate 
clauses or adverbials, the Nlitdb could generate two formulae, one with and one 
without a Fills, to capture the readings where et covers the whole or just part of It. 



As mentioned in section 2.9.2 , this approach (which was tested in one version of the 
prototype Nlitdb) has the disadvantage that it generates a formula for the reading 
where et covers the whole It even in cases where this reading is impossible. In time- 



asking questions like ( 3.76|) , for example, the reading where et covers the whole It 



(the whole 1994) is impossible, and hence the corresponding formula should not be 
generated. 

(3.76) When was tank 5 empty in 1994? 

Devising an algorithm to decide when the formulae that contain Fills should or should 
not be generated is a task which I have not addressed. For simplicity, the prototype 
Nlitdb and the rest of this thesis ignore the readings that require et to cover the 
whole It, and hence the Fills operator is not used. The Fills operator, however, may 
prove useful to other researchers who may attempt to explore further the topic of this 
section. 

3.12 The Begin and End operators 

The Begin and End operators are used to refer to the time-points where a situation 
starts or ends. For <p 6 YNFORMS: 

• \\Begin[<j)]\\ st > et > lt >9 = T, iff et C It 

et' € mxlpers({e e PERIODS \ \\<f)\\st,e,PTS,g = j 1 }) an d et = {minpt(et')} . 
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• \\End[(f))\\ st ' et ' lt ' 9 = T, iff et C It 

et' E mxlpers({e £ PERIODS \ \\(j)\\^ PT S,9 = t}) and et = {maxpt(et')}. 

Begin[<p] is true only at instantaneous event times et that are beginnings of maximal 
event times et' where <j> holds. The End operator is similar. 

The Begin and End operators can be used to express "to start", "to stop", "to begin", 
and "to finish" (section [2.6| ). For example, ( |3.77| ) is expressed as ( |3.78| ). Intuitively, 
in ( 3.78] ) the Culm[inspecting(j_adams,ukl60)] refers to an event-time period that 



covers exactly a complete inspection of UK160 by J.Adams (from start to completion). 
End[Culm[inspecting(j _adams, ukldO)]] refers to the end of that period, i.e. the com- 
pletion point of J.Adams' inspection. Begin[inspecting(t_smith,ba737)] refers to the 
beginning of an inspection of BA737 by T.Smith. The beginning of T.Smith's inspec- 
tion must precede the completion point of J.Adams' inspection, and both points must 
be in the past. 

(3.77) Did T.Smith start to inspect BA737 before J.Adams finished inspecting 
UK160? 

(3.78) Before[Past[el v , End[Culm[inspecting(j 'jadams , uM60)]]], 

Past[e2 v , Begin[inspecting(t_smith, ba737)]]] 



The reading of ( 3. 63] ) (section |3.10| ) that requires BA737 to have departed after the 



emergency system started to be in operation can be expressed as fl3.79|) . (The reading 
of ( 3.63j ) where BA737 must have departed after the system stopped being in operation 



is expressed as fl3.64p .) 

(3.79) After [Past[el v , Begin[in_operation(emergsys)}], Past[e2 v , depart(ba737)]] 



3.13 The Ntense operator 



The framework of this thesis (section |2.11| ) allows noun phrases like "the sales manager" 
in ( |3.80| ) to refer either to the present (current sales manager) or the time of the verb 
tense (1991 sales manager). The Ntense operator is used to represent these two possible 
readings. 

(3.80) What was the salary of the sales manager in 1991? 

(3.81) ?slr v Ntense[now* , manager _of(mgr v , sales)] A 
At[1991, Past[e v , salary _of{mgr v , slr v )]] 
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(3.82) ?slr v Ntense[e v , manager _of(mgr v , sales)] A 
At[1991, Past[e v , salary _of(mgr v , slr v )]} 

The reading of ( |3,80D where "the sales manager" refers to the present is represented 
as ( 3.81|) , while the reading where it refers to the time of the verb tense is represented 



as (|3.82j ). Intuitively, ( [3.81D reports any slr v , such that slr v was the salary of mgr v 
at some past time e v that falls within 1991, and mgr v is the manager of the sales 



department at the present. In contrast, (3.82) reports any slr v , such that slr v was the 



salary of mgr v at some past time e v that falls within 1991, and mgr v was the manager 
of the sales department at e v . Notice that in ( 3.82j ) the first argument of the Ntense 



is the same as the first argument of the Past, which is a pointer to the past event time 
where salary _of (mgr v , slr v ) is true (see the semantics of Past in section |3.8| ). 

For <P G YNFORMS and G VARS : 

• \\Ntense[(3, ^]\\^t,tt,g _ Tj iff f or some et > e PERIODS, it is true that g(J3) = et' 
and U\\stfit',PTS,9 = T . 



• \\Ntense[now*,(f)]\\ st ' et ' H ' 9 = T, iff ||0||^{st},PTS, 9 = T 

Ntense evaluates <j> with respect to a new event time et 1 , which may be different from 
the original event time et that is used to evaluate the part of the formula outside the 
Ntense. Within the Ntense, the localisation time is reset to PTS (whole time-axis) 
freeing et' from restrictions imposed on the original et. If the first argument of Ntense 
is now*, the new event time is the instantaneous period that contains only st, i.e. the 
object to which the noun phrase refers must have at st the property described by <f>. 
If the first argument of Ntense is a variable (3, the new event time et' can generally be 



any period, and (3 denotes et' . In ( 3.82 ), however, (3 is the same as the first argument 
of the Past, which denotes the original et that the Past requires to be placed before st. 
This means that manager _of \mgr v , sales) must hold at the same event time where 
salary _of(mgr v , slr v ) holds, i.e. the person mgr v must be the sales manager at the 
same time where the salary of mgr v is slr v . If the first argument of the Ntense in 



( 3.82j ) and the first argument of the Past were different variables, the answer would 



contain any 1991 salary of anybody who was, is, or will be the sales manager at any 



time. This would be useful in (3.83), where one may want to allow "Prime Minister" 



to refer to the Prime Ministers of all times, a reading that can be expressed as ( 3.84] ) . 
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(3.83) Which Prime Ministers were born in Scotland? 

(3.84) lpm v Ntense[eV ,pminister(pm v )] A Past[e2 v , birth Jn(pm v , Scotland)] 

The framework of this thesis, however, does not currently generate ( |3.84j ). (|3.83j ) would 
receive only two formulae, one for current Prime Ministers, and one for persons that 
were Prime Ministers at the time they were born (the latter reading is, of course, 
unlikely) . 

Questions like (|3.85|) and ( |3.87|) , where temporal adjectives specify explicitly the times 
to which the noun phrases refer, can be represented as ( p. 86 ) and ( 3.88] ). (The frame- 



work of this thesis, however, does not support temporal adjectives other than "current"; 
see section |2.8| .) 

(3.85) What was the salary of the current sales manager in 1991? 

(3.86) ?slr v Ntense[now* , manager _of(mgr v , sales)] A 
At[199l, Past[e v , salary _of(mgr v , slr v )]] 

(3.87) What was the salary of the 1988 sales manager in 1991? 

(3.88) ?slr v Ntense[el v , At[l988, manager _o j \mgr v , sales)]] A 
At[199l, Past[e v , salary _of(mgr v , slr v )]] 



The Ntense operator of Top is the same as the Ntense operator of | Crouch 91 1 and 
HCrouch fc Pulman 93 1. 



3.14 The For operator 

The For operator is used to express "for ..." and duration "in ..." adverbials (sections 
2]D| and |2~9lp . For a c G CPARTS, v qty G {1, 2, 3, ... }, and (ft G YNFORMS: 



• \\For[o- c ,v qty ,4>]\\ st ' et ' lt ' 9 =T, iff ||0|| s *> e *>^ =T, and for some PuP2 , ... ,p Vqty G 
fcparts(o~c), it is true that minpt( P i) = minpt(et), next{maxpt{p\)) = minpt( P 2), 
next{maxpt{p2)) = minpt{pz), ... , next(maxpt(p Uqty ~i)) = niinpt(p Uqty ), and 
maxpt(p Uqty ) = maxpt(et). 

For[a c ,u q t y ,(t)] requires <\> to be true at an event time period that is v qty <r c -periods 
long. For example, assuming that month denotes the partitioning of month-periods 
(the period that covers exactly the August of 1995, the period for September of 1995, 
etc.), ( 3.89Q can be expressed as ( |3.9C ). 
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(3.89) Was tank 2 empty for three months? 

(3.90) For[month c , 3, Past[e v , empty (tank2)]] 



( 3.90| ) requires an event time et to exist, such that et covers exactly three continuous 



months, and tank 2 was empty throughout et. As noted in section [2.9.3 , "for 



adverbials are sometimes used to specify the duration of a maximal period where a 
situation holds, or to refer to the total duration of possibly non-overlapping periods 
where some situation holds. The current version of Top cannot express such readings. 

Expressions like "one week", "three months", "two years", "two hours", etc., are often 
used to specify a duration of seven days, 3 x 30 days, 2 x 365 days, 2 x 60 minutes, 
etc. ( |3.90| ) expresses ([3.89 ) if "three months" refers to calendar months (e.g. from 



the beginning of a June to the end of the following August). If "three months" means 
3 x 30 days, ( |3.91|) has to be used instead. (I assume that day c denotes the partitioning 
of day-periods: the period that covers exactly 26/9/95, the period for 27/9/95, etc.) 

(3.91) For[day c , 90, Past [e v , empty (tank2)}} 



Assuming that "to inspect" is a culminating activity (as in the airport application), 
( 3,93j ) represents the reading of ( [3.92 ) where 42 minutes is the duration from the be- 



ginning of the inspection to the inspection's completion (section |2.9.4| ). ( [3. 93 ) requires 



et to cover the whole inspection (from beginning to completion), et to be in the past, 
and the duration of et to be 42 minutes. 

(3.92) J.Adams inspected BA737 in 42 minutes. 

(3.93) For [minute , 42, Past[e v , Culm[inspecting(j_adams, 6a737)]]] 



Unlike ( ^92|) , fl04j ) does not require the inspection to have been completed (section 



2.9. 3| ). ( [3.94 ) is represented as fl3.95| ), which contains no Culm. In this case, et must 
simply be a period throughout which J.Adams was inspecting BA737, it must be 
located in the past, and it must be 42 minutes long. 



(3.94) J.Adams inspected BA737 for 42 minutes. 

(3.95) For[minute c , 42, Past[e v , inspecting^ jadams , 6a737)]] 
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It 



Peif[e v , 0] 



et' et 
Figure 3.4: The Perf operator 



3.15 The Perf operator 

The Perf operator is used when expressing the past perfect. For example, ( |3.96 ) is 



expressed as ( 3.97 ). Perf could also be used to express the present perfect (e.g. (j^ 



could be represented as ( 3.99| )). This thesis, however, treats the present perfect in the 



same way as the simple past (section [2.5.4 ), and ( 3.98 ) is mapped to (|3.101|) , the same 
formula that expresses ( 3.100| ). 



(3.96) BA737 had departed. 

(3.97) Past[el v , Perf[e2 v ,depart(ba737)]] 

(3.98) BA737 has departed. 

(3.99) Pres[Perf[e v ,depart(ba737)]] 

(3.100) BA737 departed. 

(3.101) Past[e v ,depart(ba737)] 

For <p £ YNFORMS and (3 G VARS: 

• \\Perf\J3, cp]\\ st > et ' lt >9 = T, iff et C It, and for some et' G PERIODS, it is true that 
g{(3) = et', maxpt(et') -< minpt(et), and ||0|| si ' e *'. PT ^ = T. 

Perf {(3, (j)} holds at the event time et, only if et is preceded by a new event time et' 



where <f> holds (figure 3.4). The original et must be a subperiod of It. In contrast et' 
does not need to be a subperiod of It (the localisation time in ||^>|| st > e * / '- PTS '>9 is reset 
to PTS, the whole time-axis). The (3 of Perf [(3, (j)} is a pointer to et' , similar to the (3 
of Past[(3, (j)]. Ignoring constraints imposed by It, the event time et where Perf[f3, is 
true can be placed anywhere within the period that starts immediately after the end of 
et' {et' is where </> is true) and that extends up to ti as f. The informal term "consequent 



period" was used in section 2.9.1 to refer to this period 
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Using the Perf operator, the reading of Q3.102 ) where the inspection happens at some 



time before (or possibly on 27/9/95) is expressed as ( 3.103| ) (in this case, "on 27/9/95" 



provides a "reference time"; see section 2.5. 5| ) . In contrast, the reading of ( 3.102| ) where 



the inspection happens on 27/9/95 is expressed as (3.104). 



(3.102) J.Adams had inspected gate 2 on 27/9/95. 

(3.103) At[27/9/95,Past[el v ,Perf[e2 v , Culm[inspecting(ja, g2)]}]} 

(3.104) Past[el v ,Perf[e2 v ,At[27/9/95, Culm[inspecting(ja, g2)]}]] 



Let us explore formally the denotations of ( 3.103| ) and Q3.104| ). The denotation of 



(pJ33| ) w.r.t. st is T iff for some et G PERIODS and g G G, (pUOBD holds. 



(3.105) \\At[27/9/95,Past[er,Perf[e2 v , Culm[inspecting{ja, g2)]}]}\\ st < et ' PTS = T 

Assuming that 27/9/95 denotes the obvious period, by the definition of At, ( |3.105| ) 
holds iff flOg ) is true {PTS Pi f cons (27 / 9 / 95) = f cons {27 / 9 / 95)). 

(3.106) \\Past[el v , Perf[e2 v , Culm[inspecting{ja, g 2)}}]\\ st < et <fc°™( 27 / 9 / 95 )<9 = T 

By the definition of Past, ignoring el v which does not play any interesting role here, 
and assuming that st follows 27/9/95, (fU0|) is true iff fljUOTp holds. 



(3.107) \\Perf[e2 v , Culm[inspecting(ja, g 2)]}\\ st > et >fc°™( 27 / 9 / 95 )<9 = T 

By the definition of Perf (ignoring e2 v ), ( |3.107| ) holds iff for some et' G PERIODS, 

(3.108) , (3.109), and (3.110) hold. 

(3.108) etC/ cons (27/9/95) 

(3.109) maxpt(et') -< minpt(et) 

(3.110) \\Culm[inspecting{ja,g2)]\\ st ' et '' PTS ' 9 = T 

By the definition of Culm, (3.110) holds iff (3.111) - (3.115) hold. 

(3.111) et' C PTS 

(3.112) fcuims(inspecting,2)(f cons (ja)J cons (g2)) =T 

(3.113) S= |J p 

P^f P funs(inspecting,2)(f cona (ja)J conB (g2)) 

(3.114) 5/0 

(3.115) et' = [minpt(S) , maxpt(S)] 
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I S = et ' I k^h f 



27/9/95 

Figure 3.5: First reading of "J.Adams had inspected gate 2 on 27/9/95" 

Let us assume that there is only one maximal period where J.Adams is inspecting 
BA737, and that the inspection is completed at the end of that period. Then, the S 
of (3.113) is the maximal period, and (3.112) and (3.114) hold. (3.115) requires et' to 
be the same period as S, in which case (3.111) is trivially satisfied. The denotation of 
( 3. 103[ ) w.r.t. st is T (i.e. the answer to (|3.102|) is affirmative) iff for some et, et' = S, 



and (3.108) and (3.109) hold, i.e. iff there is an et within 27/9/95, such that et follows 
S (S = et' is the period that covers the whole inspection). The situation is depicted 
in figure |3.5| . In other words, 27/9/95 must contain an et where the inspection has 
already been completed. 

Let us now consider ( [3,1041 ). Its denotation w.r.t. st will be true iff for some et S 
PERIODS and g e G, (|37TT6|) holds. 

(3.116) \\Past[el v , Perf[e2 v , At[27/ 9 / 95 , Culm[inspecting(ja, g2)]]]]\\ st > et > PTS = T 



By the definition of Past, ( 3.116 ) holds iff ( |3.117 ) is true. (For simplicity, I ignore 
again e\ v and e2 v .) 

(3.117) \\Perf[e2 v ,At[27/9/95, Culm[inspecting(ja, g 2)]]]\\ at M,[tfirst,st),9 

By the definition of Perf, (pT7| ) is true iff for some et' £ PERIODS, (3.118), (3.119), 
and (3.120) hold. 

(3.118) et C [t fir8t , st) 

(3.119) maxpt(et') -< minpt(et) 

(3.120) \\At[27 / 9 / 95 , Culm[inspecting(ja, g2)]]\\ st ' et ' ' PTS ' 9 = T 



By the definition of the At operator, (3.120) holds iff (|3.121|) holds. (I assume again 
that 27/9/95 denotes the obvious period.) 

(3.121) \\Culm[inspecting(ja,g2)]\\ st ' et ' J ^ 27 / 9 / 95 ^ 9 = T 
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27/9/95 

Figure 3.6: Second reading of "J.Adams had inspected gate 2 on 27/9/95 



By the definition of Culm, (|3.121|) holds iff (3.122) - (3.126) are true. 



(3.122) et' Qf cons {27/9/95) 

(3.123) fcuims(mspecting,2)(f cons (ja)J cons (g2)) =T 

(3.124) S= |J p 

p£f P funs(inspecting,2)(f cona {ja)J cons {g2)) 

(3.125) 5/0 

(3.126) et' = [minpt(S) , maxpt(S)] 

Assuming again that there is only one maximal period where J.Adams is inspecting 
BA737, and that the inspection is completed at the end of that period, the S of (3.124) 
is the maximal period, and (3.123) and (3.125) hold. (3.126) requires et' to be the same 
as S. The denotation of (3.104) w.r.t. st is T (i.e. the answer to ( 3.102| ) is affirmative) 



iff for some et, et' = S, and (3.118), (3.119), and (3.122) hold. That is there must be 
some past et that follows S (S = et' is the period that covers the whole inspection), 
with S falling within 27/9/95 (figure |375| ) . The inspection must have been completed 
within 27/9/95. 

In fl3.127j ), where there are no temporal adverbials, the corresponding formula ( |3.12S| ) 
requires some past et (pointed to by el v ) to exist, such that et follows an et' (pointed 
to by e2 v ) that covers exactly the whole (from start to completion) inspection of gate 
2 by J.Adams. The net effect is that the inspection must have been completed in the 
past. 

(3.127) J.Adams had inspected gate 2 

(3.128) Past [el v , Perf [e2 v , Culm[inspecting(ja, g2)}\\ 



As noted in section |2.5.5| , there is a reading of fl3.128| ) (probably the preferred one) 
whereby the two-year period ends on 1/1/94, i.e. J.Adams was still a manager on 
1/1/94. Similarly, there is a reading of (|3.130 ), whereby the two-year period ends at 
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st, i.e. J.Adams is still a manager (section |2.5.4| ). These readings cannot be captured 
in Top. 

(3.129) On 1/1/94, J.Adams had been a manager for two years. 

(3.130) J.Adams has been a manager for two years. 

For example, ( |3. 131| ) requires some past et (pointed to by el v ) to exist, such that et 
falls within 1/1/94, et follows a period et' (pointed to by e2 v ), et' is a period where 
J.Adams is a manager, and the duration of et' is two years. If, for example, J.Adams 
was a manager only from 1/1/88 to 31/12/89, (|3.131|) causes the answer to (|3.129| ) to 



be affirmative. ( 3.131 ) does not require the two-year period to end on 1/1/94. 
(3.131) At[l /I m , Past[el v , Perf[e2 v , For[year c , 2, be(ja, manager)]]]] 

Various versions of Perf operators have been used in [Dowty 82 1, [Richards et al. 89 



Pirie et al. 90| ], JCrouch fc Pulman 93 1, and elsewhere. 



3.16 Occurrence identifiers 

Predicates introduced by verbs whose base forms are culminating activities often have 
an extra argument that acts as an occurrence identifier. Let us consider a scenario 
involving an engineer, John, who worked on engine 2 repairing faults of the engine at 
several past times (figure |3,7[ ) . John started repairing a fault of engine 2 on 1/6/92 at 
9:00am. He continued to work on this fault up to 1:00pm on the same day, at which 
point he temporarily abandoned the repair without completing it. He resumed the 
repair at 3:00pm on 25/6/92, and completed it at 5:00pm on the same day. 

In 1993, John was asked to repair another fault of engine 2. He started the repair on 
1/7/93 at 9:00am, and continued to work on that fault up to 1:00pm on the same day 
without completing the repair. He then abandoned the repair for ever (John was not 
qualified to fix that fault, and the repair was assigned to another engineer). Finally, 
in 1994 John was asked to repair a third fault of engine 2. He started to repair the 
third fault on 1/6/94 at 9:00am, and continued to work on that fault up to 1:00pm on 
the same day, without completing the repair. He resumed the repair at 3:00pm, and 
completed it at 5:00pm on the same day. 
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1/6/92 25/6/92 1/7/93 1/6/94 

Pi P2 ?3 ?4 p 5 
1 I — I I 1 I — I I 5 1 



9:00am 1:00pm 3:00pm 5:00pm 9:00am 1:00pm 9:00am 1:00pm 3:00pm 5:00pm 

repairing(occl,john,eng2) repairing(occl,john,eng2) repairing(occ2,john,eng2) repairing(occ3,john,eng2) repairing(occ3,john,eng2) 

i 1 , h 1 



Culm[repairing(occ 1 ,john,eng2)] Culm[repairing(occ3,john,eng2)] 

, h 

Culm[repairing(john,eng2)] 



Figure 3.7: Occurrence identifiers 



There is a problem if (|3,132|) is represented as ( |3.133j ) . Let us assume that the question 
is submitted after 1/6/94. One would expect the answer to be affirmative, since a 
complete past repair of engine 2 by John is situated within 1/6/94. In contrast, ( |3.133j ) 
causes the answer to be negative. The semantics of Culm (section |3.9|) requires et to 
start at the beginning of the earliest maximal period where repairing(john, eng2) 
holds (i.e. at the beginning of p\ in figure |3.7[ ) and to end at the end of the latest 
maximal period where repairing(john, eng2) holds (i.e. at the end of ps in figure |3.7| ). 
That is, et must be p% of figure |3.7| . The At requires et (jp%) to be also a subperiod of 
1/6/94. Since this is not the case, the answer is negative. 

(3.132) Did John repair engine 2 on 1/6/94? 

(3.133) At[l / 6 / 94 , Past[e v , Culm[repairing(john, eng2)]]] 



The problem is that although John was repairing engine 2 during all five periods (pi, 
P2, P3, Pi, and P5), the five periods intuitively belong to different occurrences of the 
situation where John is repairing engine 2. The first two periods have to do with the 
repair of the first fault (occurrence 1), the third period has to do with the repair of the 
second fault (occurrence 2), and the last two periods relate to the repair of the third 
fault (occurrence 3). The Culm[repairing(john, eng2)] of (3.133), however, does not 
distinguish between the three occurrences, and forces et to start at the beginning of 
pi and to end at the end of p§. Instead, we would like Culm[repairing(john, eng2)] to 
distinguish between the three occurrences: to require et to start at the beginning of p\ 
(beginning of the first repair) and to end at the end of P2 (completion of the first repair), 
or to require et to start at the beginning of P4 (beginning of the third repair) and to 
end at the end of p§ (completion of the third repair). (Culm[repairing(john,eng2)] 
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should not allow et to be p%, because the second repair does not reach its completion 
at the end of ^3.) 

To achieve this, an occurrence-identifying argument is added to fixing(john, eng2). If 
occl, occ2, and occi denote the three repairing-occurrences, fixing(occl,john,eng2) 
will be true only at ets that are subperiods of p\ or p%, fixing (occ2, john,eng2) only 
at ets that are subperiods of ps, and fixing(occ3, john, eng2) only at ets that are 
subperiods of p^ or p$. In practice, the occurrence- identifying argument is always a 
variable. For example, ( |3,132| ) is now represented as ([3.134 ) instead of (|3.133j ). 



(3.134) At[l / 6 / 94 , Past[e v , Culm[repairing(occ v , john, eng2)}]] 



Intuitively, according to (3.134) the answer should be affirmative if there is an et and a 
particular occurrence occ v of the situation where John is repairing engine 2, such that 
et starts at the beginning of the first period where occ v is ongoing, et ends at the end 
of the last period where occ v is ongoing, occ v reaches its completion at the end of et, 
and et falls within the past and 1/6/94. Now if ( |3.132| ) is submitted after 1/6/94, the 
answer is affirmative. 

To see that ( p. 134] ) generates the correct result, let us examine the denotation of 



(3.134). The denotation of (3.134) w.r.t. st is affirmative if for some et G PERIODS 



and g G G, ( |3.135| ) holds. 

(3.135) \\At[l / 6 / 94 , Past[e v , Culm[repairing(occ v ,john,eng2)]]}\\ st ' et ' PTS 'S = T 

Assuming that the question is submitted after 1/6/94, and that 1/6/94 denotes the 
obvious period, by the definitions of At and Past, ( |3.135[ ) holds iff (3.136) and (3.137) 
hold. 

(3.136) g(e v ) = et 

(3.137) \\Culm[repairing(occ v ,john,eng2)]\\ st ' etJmn ^ 1/6/94) ^ = T 



By the definition of Culm, (3.137) holds iff (3.138) - (3.141) hold, where S is as in 
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(3.142). 

(3.138) etQf cons (l/6/94) 

(3.139) fculms(repairing, 2)(g(occ v )J cons (john)J cons (eng2)) = T 

(3.140) 5^0 

(3.141) et = [minpt(S),maxpt(S)] 

(3.142) 5= |J p 

p£fpfuns(repairing,2)(g(occ v )J cons (john)J cons (eng2)) 



The denotation of (3.134) w.r.t. st is T (i.e. the answer to ( |3. 132 ) is affirmative), iff for 



some et G PERIODS and g G G, (3.136) and (3.138) - (3.141) hold. For et as in (3.143) 
and g the variable assignment of (3.144), (3.136) and (3.138) hold. (3.142) becomes 
(3.145), and (3.140) holds. (3.141) becomes (3.143), which holds (et was chosen to 
satisfy it). (3.139) also holds, because the third repair is completed at the end of p$. 

(3.143) et = [minpt{p±),maxpt(jp§j\ =p-j 

(3.144) 

(et if(3 = e v 

9(fl) = S fcons(occ3) if /? = occ v 

I o otherwise (o is an arbitrary element of OBJS) 

(3.145) 5 = p 4 Up 5 

Hence, there is some et G PERIODS and g G G for which (3.136) and (3.138) - (3.141) 



hold, i.e. the answer to ( 3.132 ) will be affirmative as wanted. 



Occurrence identifiers are a step towards formalisms that treat occurrences of situa- 



tions (or "events" or "episodes") as objects in the modelled world (e.g. Parsons 90 ] 



RKamp fc Reyle 93| , []Blackburn et al. 94fl , jHwang fc Schubert 94|1 ). In Top all terms 



(constants and variables) denote elements of OBJS, i.e. objects of the modelled world. 
Thus, allowing occurrence-identifying terms (like occ v in (13.134|) ) implies that occur- 



rences of situations are also world objects. Unlike other formalisms (e.g. those men- 
tioned above), however, Top does not treat these occurrence- identifying terms in any 
special way, and there is nothing in the definition of Top to distinguish objects denoted 
by occurrence-identifiers from objects denoted by other terms. 
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Although tense anaphora (section |2.12j ) was not considered during the work of this 
thesis, it seems that Top's localisation time could prove useful if this phenomenon 



were to be supported. As noted in section [2.1% some cases of tense anaphora can 
be handled by storing the temporal window established by adverbials and tenses of 
previous questions, and by requiring the situations of follow-up questions to fall within 
that window. Top's It can capture this notion of previous window. Assuming that 



( 3.146 ) is submitted after 1993, the At and Past operators of the corresponding formula 

( 3.147 ) narrow It to the period that covers exactly 1993. This period could be stored, 
and used as the initial value of It in ( |3.149| ), that expresses the follow-up question 
(|3l48D . In effect, (|T4|) would be taken to mean ( [050) ). 



(3.146) Was Mary the personnel manager in 1993? 

(3.147) ^4i[1993, Past[e v ,manager_of(mary, personnel)]] 

(3.148) Who was the personnel manager? 

(3.149) ?wh v Past[e v , manager _of{wh v , personnel)] 

(3.150) Who was the personnel manager in 1993? 



Substantial improvements are needed to make these ideas workable. For example, if 
( 3.146 ) and ( 3.148 ) are followed by ( 3. 151|) (expressed as ( |3. 152j) ) , and the dialogue 



takes place after 1993, the Nlitdb must be intelligent enough to reset It to the whole 
time axis. Otherwise, no person will ever be reported, because the Pres of ( |3.152| ) 
requires et to contain st, and an et that contains st can never fall within the past year 
1993 (the It of the previous question). 

(3.151) Who is (now) the personnel manager? 

(3.152) ?wh v Pres[manager_of (wh v , personnel)] 

3.18 Expressing habituals 



As noted in section |3.7| , habitual readings of sentences are taken to involve habitual 
homonyms of verbs. Habitual homonyms introduce different predicates than the cor- 
responding non-habitual ones. For example, ( |3,153j ) and ( 3.155 ) would be expressed 



as ( [3.154 ) and ( |3.156| ) respectively. Different predicates would used in the two cases. 
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(3.153) Last month BA737 (habitually) departed from gate 2. 

(3.154) Part[month c ,mon v , — 1] A 

At[mon v , Past[e v , hab_depart-from(ba737, gate2)]] 

(3.155) Yesterday BA737 (actually) departed from gate 2. 

(3.156) Part[day c , y v ,—l] A 

At[y v , Past[e v , actl_depart_from(ba737,gate2)]] 



hab_depart-from(ba737, gate2) is intended to hold at ets that fall within periods where 
BA737 has the habit of departing from gate 2. If BA737 departed habitually from gate 
2 throughout 1994, hab-depart-from(ba737, gate2) would be true at any et that is a 
subperiod of 1994. In contrast, actl-depart_from(ba737, gate2) is intended to hold only 
at ets where BA737 actually departs from gate 2. If departures are modelled as instan- 
taneous (as in the airport application), actl_depart-from(ba737,gate2) is true only at 
instantaneous ets where BA737 leaves gate 2. One would expect that if BA737 had the 
habit of departing from gate 2 during some period, it would also have actually departed 
from gate 2 at least some times during that period: if hab_depart-from(ba737, gate2) 
is true at an et, actl_depart_from(ba737, gate2) would also be true at some subperi- 
ods et' of et. There is nothing in the definition of Top, however, to guarantee that 
this implication holds. The event times where hab-depart-from(ba737, gate2) and 
actLdepart_from(ba737, gate2) hold are ultimately determined by f p f uns (that speci- 
fies the maximal periods where the two predicates hold; see section |3.4| ). There is no 
restriction in the definition of Top to prohibit whoever defines f p f uns from specifying 
that hab-depart-from(ba737,gate2) is true at some et that does not contain any et' 
where actl-depart-from(ba737,gate2) is true. 



Another issue is how to represent ( p.157 ). ( 3.157| ) cannot be represented as ( |3.15£ ). 



(3.158) says that at 5:00pm on some day in the previous month BA737 had the habit 



of departing. I have found no elegant solution to this problem. ( |3.157 ) is mapped to 



(3.159), where the constant 5:00pm is intended to denote a generic representative of 



5:00pm-periods. This generic representative is taken to be an entity in the world. 

(3.157) Last month BA737 (habitually) departed at 5:00pm. 

(3.158) Part[month c ,mon v , -1] A Part[5:00pm 9 , fv v ] A 
At[mon v , At[fv v , Past[e v , hab.depart(ba737)]] 

(3.159) Part[month c ,mon v , — 1] A 

At[mon v , Past[e v , hab_depart-time(ba737 ', 5:00pm)]] 
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Unlike ( p.!57| ), where "at 5:00pm" introduces a constant (5:00pm) as a predicate- 
argument in ( |3.159| ), the "at 5:00pm" of ( 3.160| ) introduces an At operator in ( |3. 161 ) 



(3.160) Yesterday BA737 (actually) departed at 5:00pm. 

(3.161) Part[day c , y v , -1] A Part[5:00pm° , fv v ] A 
At[y v , At[fv v , Past[e v , actLdepart(ba737)}]} 



The fact that "at 5:00pm" is treated in such different ways in the two cases is admittedly 
counter-intuitive, and it also complicates the translation from English to Top (to be 
discussed in chapter 0) . 



3.19 Summary 

Top is a formal language, used to represent the meanings of the English questions that 
are submitted to the Nlitdb. The denotation with respect to st of a Top formula 
specifies what the answer to the corresponding English question should report (st is 
the time-point where the question is submitted to the Nlitdb). The denotations with 
respect to st of Top formulae are defined in terms of the denotations of Top formulae 
with respect to st, et, and It. et (event time) is a time period where the situation 
described by the formula holds, and It (localisation time) is a temporal window within 
which et must be placed. 

Temporal linguistic mechanisms are expressed in Top using temporal operators that 
manipulate st, et, and It. There are thirteen operators in total. Part picks a period 
from a partitioning. Pres and Past are used when expressing present and past tenses. 
Perf is used in combination with Post to express the past perfect. Culm is used to 
represent non-progressive forms of verbs whose base forms are culminating activities. 
At, Before, and After are employed when expressing punctual and period adverbials, 
and when expressing "while ... ", "before ... ", and "after ... " subordinate clauses. 
Duration "in ... " and "for ... " adverbials are expressed using For. Fills can be 
used to represent readings of sentences where the situation of the verb covers the 
whole localisation time; Fills, however, is not used in the rest of this thesis, nor in the 
prototype Nlitdb. Begin and End are used to refer to time-points where situations 
start or stop. Finally, Ntense allows noun phrases to refer either to st or to the time 
of the verb's tense. 



Chapter 4 



From English to TOP 



11 One step at a time." 

4.1 Introduction 



This chapter shows how Hpsg [Pollard Sz Sag 87 1 [Pollard & Sag 94] was modified to 



map English questions directed to a Nlitdb to appropriate Top formulae.^ Although 
several modifications to HPSG were introduced, the HPSG version of this thesis remains 



very close to [Pollard & Sag 94]. The main differences from [Pollard & Sag 94] are 
that: (a) HPSG mechanisms for phenomena not examined in this thesis (e.g. pronouns, 
relative clauses) were removed, and (b) the situation-theoretic semantic constructs of 
Hpsg were replaced by feature structures that represent Top expressions. 

Readers with a rudimentary grasp of modern unification-based grammars [Shieber 86] 



should be able to follow most of the discussion in this chapter. Some of the details, 
however, may be unclear to readers not familiar with Hpsg. The Hpsg version of this 
thesis was implemented as a grammar for the Ale system (see chapter ^) . 

4.2 HPSG basics 

In Hpsg, each word and syntactic constituent is mapped to a sign, a feature structure 

of a particular form, that provides information about the word or syntactic constituent. 

An Hpsg grammar consists of signs for words (I call these lexical signs), lexical rules, 

1 The Hpsg versio n of this thesis is based on the revised Hpsg version of chapter 9 of 
[Pollard fc Sag 94 
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4.2.1 Lexical signs and sort hierarchy 



Lexical signs provide information about individual words. (Words with multiple uses 
may receive more than one lexical sign.) ( |4.1| ) shows a lexical sign for the base form 
of "to land" in the airport domain. 



(4.1) 



phon (land) 



SYNSEM 



LOC 



CAT 



VFORM bse 
AUX 



verb 

ASPECT culmact 



SPR 
SUBJ 

COMPS 





{ NP 



-PRD 



-PRD, PFORM On 



m ) 



CONT 



landing_on 



ARGl occrjuar 
ARG2 Q] 
ARG3 H 



The < and > delimiters denote lists. The phon feature shows the list of words to 
which the sign corresponds ((|4.1|) corresponds to the single word "land"). Apart from 



PHON, every sign has a synsem feature (as well as other features not shown in (4.1); 
I often omit features that are not relevant to the discussion). The value of synsem 
in ( [4.1| ) is a feature structure that has a feature LOC. The value of LOC is in turn a 
feature structure that has the features CAT (intuitively, syntactic category) and CONT 
(intuitively, semantic content). 

Each Hpsg feature structure belongs to a particular sort. The sort hierarchy of Hpsg 
shows the available sorts, as well as which sort is a subsort of which other sort. It also 
specifies which features the members of each sort must have, and the sorts to which 
the values of these features must belong. (Some modifications were made to the sort 
hierarchy of [Pollard & Sag 94]. These will be discussed in sections 4.3 and |4.4| .) In 
(4.1), for example, the value of HEAD is a feature structure of sort verb. The value of 
head signals that the word is the base form (vform bse) of a non- auxiliary (aux — ) 
verb. The sort hierarchy of [Pollard &: Sag 94 1 specifies that the value of head must 
be of sort head, and that verb is a subsort of head. This allows feature structures of 



sort verb to be used as values of head. The value of vform in is an atomic 
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feature structure (a feature structure of no features) of sort bse. For simplicity, when 
showing feature structures I often omit uninteresting sort names. 

aspect is the only new Hpsg feature of this thesis. It is a feature of feature structures 
of sort cat (feature structures that can be used as values of cat), and its values are 
feature structures of sort aspect, aspect contains only atomic feature structures, and 
has the subsorts: state, activity, culmact (culminating activity), and point, state is 
in turn partitioned into: lexstate (lexical state), progressive (progressive state), and 
cnsqstate (consequent state). This agrees with the aspectual taxonomy of chapter [2]. 
Following table ^j] on page |2(| (|4.1| ) classifies the base form of "to land" as culminating 
activity. 



The SPR, SUBJ, and COMPS features of (4.1) provide information about the specifier, 



subject, and complements with which the verb has to combine. Specifiers are deter- 
miners (e.g. "a", "the"), and words like "much" (as in "much more") and "too" (as in 
"too late"). Verbs do not admit specifiers, and hence the value of SPR in (|4.1| ) is the 
empty list. 

The SUBJ value of ([4.1D means that the verb requires a noun-phrase as its subject. 



The NP[-PRD]rjj in ( |4.1| ) has the same meaning as in Pollard fc Sag 94 ] . Roughly 
speaking, it is an abbreviation for a sign that corresponds to a noun phrase. The -PRD 
means that the noun phrase must be non-predicative (see section [4.9| below). The is 
intuitively a pointer to the world entity described by the noun phrase. Similarly, the 
COMPS value of ( |4.1| ) means that the verb requires as its complement a non-predicative 
prepositional phrase (section [4.8| below), introduced by "on". The H] is intuitively a 
pointer to the world entity of the prepositional phrase (e.g. if the prepositional phrase 
is "on a runway", the [H is a pointer to the runway). 



The value of CONT in (4.1) represents the Top predicate landing _on(/3, ti, T2), where 
T\ and T2 are Top terms corresponding to CD and [2], and (3 is a Top variable acting 
as an occurrence identifier (section p.!6| ).p| The exact relation between Hpsg feature 
structures and Top expressions will be discussed in the following sections. 



2 I follow the approach of section 8.5.1 of [Pollard & Sag 94], whereby the relation feature is dropped, 
and its role is taken up by the sort of the feature structure. 
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Lexical rules generate new lexical signs from existing ones. In section |4.7| , for example, 
I introduce lexical rules that generate automatically lexical signs for (single-word) non- 
base verb forms (e.g. a sign for the simple past "landed") from signs for base forms (e.g. 
(|4.l| )). This reduces the number of lexical signs that need to be listed in the grammar. 



4.2.3 Schemata and principles 

Hpsg schemata specify basic patterns that are used when words or syntactic con- 
stituents combine to form larger constituents. For example, the head- complement 
schema is the pattern that is used when a verb combines with its complements (e.g. 
when "landed" combines with its complement "on runway 2"; in this case, the verb 
is the "head-daughter" of the constituent "landed on runway 2"). The head-subject 
schema is the one used when a verb phrase (a verb that has combined with its comple- 
ments but not its subject) combines with its subject (e.g. when "landed on runway 2" 
combines with "BA737"; in this case, the verb phrase is the head-daughter of "BA737 



landed on runway 2"). No modifications to the schemata of [ Pollard fc Sag 94 ] are 



introduced in this thesis, and hence schemata will not be discussed further. 

Hpsg principles control the propagation of feature values from the signs of words 
or syntactic constituents to the signs of their super-constituents. The head feature 
principle, for example, specifies that the sign of the super-constituent inherits the 
head value of the head-daughter's sign. This causes the sign of "landed on runway 2" 
to inherit the HEAD value of the sign of "landed", and the same value to be inherited by 
the sign of "BA737 landed on runway 2". This thesis uses simplified versions of Pollard 
and Sag's semantics principle and constituent ordering principle (to be discussed in 
sections 1.9.1 and 4.13| ), and introduces one new principle (the aspect principle, to be 



discussed in section [4. 11.1 ). All other principles are as in pollard fc Sag 94fl . 



4.3 Representing TOP yes/no formulae in HPSG 



According to 



[ Pollard fe Sag 94fl , the CONT value of ( fO]) should actually be (|4~2j ). 
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psoa 



predicate 



operator 



circling 



ARGl ind 

landing_on 



ARGl occrjuar 
ARG2 ind 
arg3 ind 



pres 



MAIN_PSOA psoa 



Figure 4.1: Subsorts of psoa 



(4.2) 



QUANTS (} 



NUCLEUS 



psoa 



landing_on 



ARG 1 occr_var 
ARG2 Q] 

arg3 m 



In [Pollard & Sag 94 1, feature structures of sort psoa have two features: QUANTS and 
nucleus]^] QUANTS, which is part of Hpsg's quantifier storage mechanism, is not used 
in this thesis. This leaves only one feature (nucleus) in psoas. For simplicity, nucleus 
was also dropped, and the psoa sort was taken to contain the feature structures that 
would be values of nucleus in [ pollard k. Sag 94 ], 



More precisely, in this thesis psoa has two subsorts: predicate and operator (figure 



4.1). predicate contains feature structures that represent Top predicates, while opera- 
tor contains feature structures that represent all other Top yes/no formulae. (Hence, 
psoa corresponds to all yes/no formulae.) predicate has domain-specific subsorts, corre- 
sponding to predicate functors used in the domain for which the Nlitdb is configured. 
In the airport domain, for example, landing_on is a subsort of predicate. The feature 
structures in the subsorts of predicate have features named ARGl, ARG2, arg3, etc. 
These represent the first, second, third, etc. arguments of the predicates. The values 
of ARGl, ARG2, etc. are of sort ind (occr-var is a subsort of ind). ind will be discussed 
further below. 



The operator sort has thirteen subsorts, shown in figure |4.2| . These correspond to the 
twelve Top operators (Fills is ignored), plus one sort for conjunction.^ The order of the 
features in figure rL2 corresponds to the order of the arguments of the Top operators. 



3 "Psoa" stands for "parameterised state of affairs", a term from situation theory | Cooper et al. 90 1 
The seman tic analysis here is not situation-theoretic, but the term "psoa" is still used for compati- 



bility with |Pollard fc Sag 94 



The sorts that correspond to the At, Before, After, and For operators are called at_op, before_op, 
after_op, and /or_op to avoid name clashes with existing Hpsg sorts. 
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pres 

past 

perf 
at_op 
before_op 

after_op 



MAIN_PSOA psoa 



ET_HANDLE 
MAIN_PSOA 
ET_HANDLE 
MAIN_PSOA 

TIME_SPEC 
MAIN_PSOA 

TIME_SPEC 
MAIN_PSOA 

TIME_SPEC 
MAIN_PSOA 



TVAR + 



TVAR + 



temp_ent 
psoa 

temp_ent 
psoa 

temp_ent V psoa 
psoa 

temp_ent V psoa 
psoa 

temp_ent V psoa 
psoa 



culm 



and 



begin 



MAIN_PSOA predicate 

CONJUNCTl psoa 
CONJUNCT2 psoa 



MAIN_PSOA psoa 



part 



ntense 



for_op 



MAIN_PSOA psoa 

end 

ET_HANDLE UOW V 



TVAR + 



PAR.TNG compLpartng V 



gappyjpartng 



temp_ent 

MAIN_PSOA psoa 

DUR_UNIT compLpartng 

DURATION TYAI! 

semjnum 
MAIN_PSOA psoa 



TVAR 



PART.VAR 



temp_ent 



TVAR 



Figure 4.2: Subsorts of operator 



For example, the et_handle and main_PSOA features of the past sort correspond to 
the first and second arguments respectively of Top's Past[/3, (j)]. For simplicity, in the 
rest of this thesis I drop the Part[a, (3, v rd] version of Part (section and I represent 
words like "yesterday" using Top constants (e.g. yesterday) rather than expressions 



like Part[day c , (3, —1]. This is why there is no sort for Part[a, (3, v Q rd\ i n figure 4.2. 

In [ Pollard Sz Sag 94[ , feature structures of sort ind (called indices) have the features 
person, number, and gender, which are used to enforce person, number, and gender 
agreement. For simplicity, these features are ignored here, and no agreement checks 
are made. Pollard and Sag's subsorts of ind (ref, there, it), which are used in Hpsg's 
binding theory, are also ignored here. In this thesis, indices represent Top terms 
(they also represent gappy partitioning names, but let us ignore this temporarily). 



The situation is roughly speaking as in figure 4.3. For each Top constant (e.g. 6a737, 
gate2), there is a subsort of ind that represents that constant. There is also a subsort 
var of ind, whose indices represent Top variables. A TVAR feature is used to distinguish 
indices that represent constants from indices that represent variables. All indices of 
constant-representing sorts (e.g. ba 737, ukl60) have their TVAR set to — . Indices of 
var have their TVAR set to +. 
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TVAR. — 




TVAR. — 




TVAR — 




TVAR + 


ba737 




ukl60 




gate2 




var 





Figure 4.3: ind and its subsorts - simplified version 



The fact that there is only one subsort (var) for Top variables in figure 4.3 does not 
mean that only one Top variable can be represented, var is a sort of feature structures, 
containing infinitely many feature-structure members. Although the members of var 
cannot be distinguished by their feature values (they all have TVAR set to +), they 
are still different; i.e. they are "structurally identical" but not "token-identical" (see 
chapter 1 of [Pollard & Sag 94]). Each one of the feature-structure members of var 



represents a different Top variable. The subsorts that correspond to Top constants 
also contain infinitely many different feature-structure members. In this case, however, 
all members of the same subsort are taken to represent the same constant. For example, 
any feature structure of sort gate2 represents the Top constant gate2. 

4.4 More on the subsorts of ind 



The subsorts of ind are actually more complicated than in figure 4.5. Natural lan 



guage front-ends (e.g. Masque JAuxerre fc Inder 86], Team prosz et al. 87f, Cle 



| |Alshawi 92 ], SystemX [ Cercone et al. 93fl ) often employ a domain-dependent hierar- 



chy of types of world entities. This hierarchy is typically used in disambiguation, and 
to detect semantically anomalous sentences like "Gate 2 departed from runway 1". 
Here, a hierarchy of this kind is mounted under the ind sort. (Examples illustrating 
the use of this hierarchy are given in following sections.) 

In the airport domain, there are temporal world entities (the Monday 16/10/95, the 
year 1995, etc.), and non-temporal world entities (flight BA737, gate 2, etc.). Indices 
representing temporal entities are classified into a temp_ent subsort of ind, while in- 



dices representing non-temporal entities are classified into non_temp-ent (see figure 4.4; 
ignore partng and its subsorts for the moment). non_temp_ent has in turn subsorts 
like mass (indices representing mass entities, e.g. foam or water), flight_ent (indices 
representing flights, e.g. BA737), etc. flight_ent has one subsort for each Top constant 
that denotes a flight (e.g. ba!31, uk!60), plus one sort (flight-ent-var) whose indices 
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represent Top variables that denote nights. The other children-sorts of non_temp_ent 
have similar subsorts. 

temp^ent has subsorts like minute_ent (indices representing particular minutes, e.g. 
the 5:00pm minute of 1/1/91), day_ent (indices representing particular days), etc. 
minute-ent has one subsort for each Top constant that denotes a particular minute 
(e.g. 5:00pml/l/91), plus one sort (minute-ent-var) whose indices represent Top vari- 
ables that denote particular minutes. The other children-sorts of temp_ent have similar 
subsorts. The indices of other _temp-entji)ar (figure [O]) represent Top variables that 
denote temporal-entities which do not correspond to sister-sorts of other-temp _ent-var 
(minute-ent, day^ent, etc.). This is needed because not all Top variables denote par- 
ticular minutes, days, months, etc. In (|4.3| ), for example, e v denotes a past period that 
covers exactly a taxiing of UK160 to gate 1 (from start to completion). The taxiing 
may have started at 5:00pm on 1/1/95, and it may have been completed at 5:05pm on 
the same day. In that case, e v denotes a period that is neither a minute-period, nor a 
day-period, nor a month-period, etc. 

(4.3) Past[e v , Culm[taxiingJ,o{occr v ,ukld0, gatel)]] 



occr-var contains indices that represent Top variables used as occurrence identifiers 
(section p. 16 ). Indices of sorts that represent Top constants (e.g. foam, 5:00pml/l/91, 
Jun93 in figure 4.4) have their TVAR set to — . Indices of sorts that represent Top 
variables (e.g. flight_ent-var, minute-ent-var, other-temp-ent-var, occr^var) have their 
TVAR set to +. There is also a special sort now (not shown in figure |4.4|) that is used 
to represent the Top expression now* (section |3.13| ). 



The sorts of figure [4^ mirror the definitions of Top's operators. For example, the 
ntense sort reflects that fact that the first argument of an Ntense operator must be 
now* or a variable (tvar, +) denoting a period (temp_ent), while the second argument 
must be a yes/no formula (psoa). (The semjnum sort in for^op is a child-sort of 
non_temp-ent, with subsorts that represent the numbers 1, 2, 3, etc. The compLpartng 
and gappy _partng sorts in for^op and part are discussed below.) 

The hierarchy under ind is domain-dependent. For example, in an application where 
the database contains information about a company, the subsorts of non_temp_ent 
would correspond to departments, managers, etc. I assume, however, that in all appli- 
cation domains, ind would have the children-sorts temp-ent, non-temp_ent, occr-var, 
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gappy <_partng (to be discussed below), and possibly more. I also assume that the sub- 



sorts of partng (see below) and temp.ent would have the general form of figure 4.4 
though they would have to be adjusted to reflect the partitionings and temporal enti- 
ties used in the particular application. 



I now turn to the partng sort of figure 4.4, which has the subsorts compLpartng and 



gappyjpartng (these three sorts do not exist in | Pollard &: Sag 94j ). For each Top 
complete or gappy partitioning name (e.g. minute , day c , 5:00pm 9 , monday 9 ) there is 
a leaf-subsort of compLpartng or gappyjpartng respectively that represents that name. 
(The leaf-subsorts of gappyjpartng are also used to represent some Top terms; this 



is discussed below.) In figure 4.4 , the sorts 5:00pm, 9:00am, etc. are grouped under 
minute-gappy to reflect the fact that the corresponding partitionings contain minute- 
periods. (I assume here that these partitioning names denote the obvious partition- 
ings.) Similarly, monday, tuesday, etc. are grouped under day^gappy to reflect the 



fact that the corresponding partitionings contain day-periods. Section 4.12 provides 
examples where sorts like minute-gappy and day_gappy prove useful. 

Apart from gappy partitioning names, the subsorts of gappyjpartng are also used to 
represent Top terms that denote generic representatives of partitionings (section |3.18 ). 
(To allow the subsorts of gappyjpartng to represent TOP terms, gappyjpartng is not only 



a subsort of partng, but also of ind; see figure iA.) For example, ( [4.5|) (that expresses 
the habitual reading of ( |4.4| )) is represented as (|4.6[ ). In this case, the subsort 5:00pm 
of gappyjpartng represents the TOP constant 5:00pm. 

(4.4) BA737 departs (habitually) at 5:00pm 

(4.5) Pres[hab_departs-at(ba737 , 5:00pm)] 
(4.6) 



pres 



MAIN_PSOA 



hab_departs_at 



ARGl 



ARG2 



ba737 



5:00pm 



In contrast, ( |4.8| ) (that expresses the non-habitual reading of ( [4.7[ )) is represented as 
( [4.9| ). In this case, the subsort 5:00pm of gappyjpartng represents the Top gappy 
partitioning name 5:00pm 9 . (It cannot represent a Top term, because Top terms 
cannot be used as first arguments of Part operators.) The CDs in ( |4.9| ) mean that the 
values of part_var and time_spec must be token-identical, i.e. they must represent 
the same Top variable. 
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(4.7) 
(4.8) 
(4.9) 



BA737 departed (actually) at 5:00pm. 
Part[5:00pm 9 , fv v ] A At[fv°, Past[e v , depart(ba737)}} 



and 



conjunct! 

part 

CONJUNCT2 

at_op 



PARTNG 



PART.VAR 



5:00pm 



TVAR — 



minute_ent_var 
TIME_SPEC El 

ET_HANDLE 

MAIN_PSOA 



TVAR + 



m 



past 



temp_ent 



TVAR + 



MAIN_PSOA 



depart 



ARGl 



ba737 



TVAR 



The minute-gappy and gappy.var sorts of figure 4.4 are used only to represent Top vari 



ables that denote generic representatives of unknown minute-gappy or day_gappy par- 
titionings. The t v variable of ( |4.1l| ), for example, denotes the generic representative of 
an unknown minute-gappy partitioning. (If BA737 departs habitually at 5:00pm, t v de- 
notes the generic representative of the 5:00pm 9 partitioning, the same generic represen- 
tative that the 5:00pm constant of (|4.5|) denotes.) The Pres[hab-departs-at(ba737 , t v )] 
part of ( [4.11 ) is represented as ( 4.12| ). (The feature-structure representation of quan- 
tifiers will be discussed in section W. 



(4.10) When does BA737 (habitually) depart? 

(4.11) ?r Pres[hab_departs.at(ba737,t v )] 



(4.12) 



pres 



MAIN_PSOA 



hab_departs_at 



ARGl 



ARG2 



ba737 



minute^gappyjoar 



TVAR + 



The indices of sorts like minute-gappy-var and day_gappy_var have their TVAR set to 
+. The indices of all other leaf-subsorts of gappyjpartng (e.g. 5:00pm, monday) have 
their tvar set to — . 



4.5 Representing TOP quantifiers in HPSG 

Top yes/no formulae are represented in the Hpsg version of this thesis as feature- 
structures of sort psoa (figure |4.l[) . To represent Top wh-formulae (formulae with 
interrogative or interrogative-maximal quantifiers) additional feature-structure sorts 
are needed. I discuss these below. 
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Feature structures of sort quant represent unresolved quantifiers (quantifiers whose 
scope is not known yet). They have two features: det and restind (restricted index), 
as shown in ( 4.13] ), The det feature shows the type of the quantifier. In this thesis, det 
can have the values exists (existential quantifier), interrog (interrogative quantifier), 
and interrog _mxl (interrogative-maximal quantifier). (Apart from the values of DET, 
quant is as in ffPollard fc Sag 94| .) 



(4.13) 



quant 



DET 

RESTIND 
nom_obj 



exists V interrog V interrogjraxl 



INDEX 

ind 

RESTR set(psoa) 



TVAR + 



The values of restind are feature structures of sort nom-obj (nominal object). f] These 
have the features index (whose values are of sort ind) and RESTR (whose values are 
sets of psoas). When a nom-obj feature structure is the value of restind, the index 
corresponds to the Top variable being quantified, and the restr corresponds to the 
restriction of the quantifier. (If the restr set contains more than one psoas, the 
psoa-elements of the set are treated as forming a conjunction.) For example, (4.14) 



represents ( 4.15|) 



(4.14) 



DET 



RESTIND 



interrog 

INDEX 
RESTR 



nom_obj 



ind 



TVAR + 



m 



flight 



ARGl [D 



(4.15) 



quant 

V v flights 



Although Top does not use explicit existential quantifiers (universal quantification is 
not supported, and Top variables can be thought of as existentially quantified), the 
Hpsg version of this thesis employs explicit existential quantifiers (quants whose det is 
exists) for compatibility with [Pollard & Sag 94]. These explicit existential quantifiers 



are removed when extracting Top formulae from signs (this is discussed in section 4.6 
below) . 



5 In [Pollard & Sag 94 , nom,-obj has the subsorts npro (non-pronoun) and pron (pronoun). These 
subsorts are not used in this thesis. 
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4.6 Extracting TOP formulae from HPSG signs 



The parser maps each question to a sign. (Multiple signs are generated when the parser 
understands a question to be ambiguous.) For example, "Which inspector was at gate 
2?" is mapped to (|4.16| ) (exactly how ( 4.16 ) is generated will become clearer in the 



following sections; see also the comments about Ntenses in section 4 . 9 . l| below). 

phon {which, inspector, was, at, gate2) 



(4.16) 



SYNSEM LOC 



CAT 



HEAD 



VFORM fin 
AUX + 



verb L 
ASPECT lex_state 



SPR 

SUBJ 

COMPS 









CONT 



QSTORE 



quant 



past 
DET 

RESTIND 



ET_HANDLE 


TVAR 


+ 




temp_ent 










ARGl 


m 




MAIN_PSOA 










ARG2 


gate2 




located_at 









interrog 

INDEX 
RESTR 



nom_obj 



person_ent 



inspector 



TVAR + 
ARGl □ 



m 



Apart from the features that were discussed in section \j.2[ signs also have the feature 
QSTORE, whose values are sets of quants (section p~5| ). The CONT value of signs that 
correspond to questions is of sort psoa, i.e. it represents a Top yes/no formula. In 
the Hpsg version of this thesis, the QSTORE value represents quantifiers that must be 
"inserted" in front of the formula of CONT. In the prototype Nlitdb (to be discussed 
in chapter ||), there is an "extractor" of Top formulae that examines the CONT and 
QSTORE features of the question's sign, and generates the corresponding Top formula. 
This is a trivial process, which I discuss only at an abstract level: the extractor first 
examines recursively the features and feature values of CONT, rewriting them in term 
notation (in (4.16), this generates ( |4.17j )); then, for each element of QSTORE, the 
extractor adds a suitable quantifier in front of the formula of CONT (in ( [4.16| ), this 
transforms ( [4.17 ) into (4.18)). 

(4.17) Past[e v ,located.at(p v ,gate2)] 

(4.18) 7p v inspector (p v ) A Past[e v , located.at(p v , gate2)] 
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In the case of elements of QSTORE that correspond to existential quantifiers, no ex- 
plicit existential quantifier is added to the formula of CONT (only the expression that 
corresponds to the restr of the quant-element is added). For example, if the det of 
flO] ) were exists, ( gig ) would be ( gig ). 



(4.19) inspector {p") A Past[e v , located-at(p v , ^ate2)] 



The extracted formula then undergoes an additional post-processing phase (to be dis- 
cussed in section 4.17). This is a collection of transformations that need to be applied 
to some of the extracted formulae. (In ( 4.18Q and ( f4.19| ), the post-processing has no 
effect.) 



4.7 Verb forms 

I now present the treatment of the various linguistic constructs, starting from verb 
forms (simple present, past continuous, etc.). (Pollard and Sag do not discuss temporal 
linguistic mechanisms.) 

4.7.1 Single- word verb forms 



Let us first examine the lexical rules that generate signs for (single-word) non-base 
verb forms from signs for base forms. The signs for simple present forms are generated 



by (14201 ). 



(4.20) Simple Present Lexical Rule: 



PHON (A) 



SYNSEM LOC 



CAT 



HEAD 



VFORM bse 
AUX 



verb 

ASPECT lex_state 



CONT 



PHON (morph(\, simple_present)) 



SYNSEM LOC 



CAT 



CONT 



VFORM fin 
AUX 



verb L 
ASPECT lexstate 



pres 



MAIN_PSOA H] 
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( 4.20Q means that for each lexical sign that matches the first feature structure (the 
"left hand side", LHS) of the rule, a new lexical sign should be generated as shown 
in the second feature structure (the "right hand side", RHS) of the rule. (Following 
standard Hpsg notation, I write synsem|loc to refer to the LOC feature of the value of 
SYNSEM.) The heads of the LHS and RHS mean that the original sign must correspond 
to the base form of a non-auxiliary verb (auxiliary verbs are treated separately), and 
that the resulting sign corresponds to a finite verb form (a form that does not need to 
combine with an auxiliary verb). The CONT of the new sign is the same as the CONT 
of the original one, except that it contains an additional Pres operator. Features of 
the original sign not shown in the LHS (e.g. SUBJ, COMPS) have the same values in the 



generated sign. fl4.20|) requires the original sign to correspond to a (lexical) state base 
form. No simple present signs are generated for verbs whose base forms are not states. 



This is in accordance with the assumption of section 2.5.1 that the simple present can 
be used only with state verbs. 

morph(X, simple-present) denotes a morphological transformation that generates the 
simple present form (e.g. "contains") from the base form (e.g. "contain"). The proto- 
type Nlitdb actually employs two different simple present lexical rules. These generate 
signs for singular and plural simple present forms respectively. As mentioned in sections 



2.13 and L3, plurals are treated semantically as singulars, and no number-agreement 
checks are made. Hence, the two lexical rules differ only in the phon values of the 
generated signs. 

( 4.21] ) shows the base form sign of "to contain" in the airport domain. From ( 4.21| ), 
( 4.20Q generates (4.22). The tank_ent and mass-ent in ( 4.21| ) and ( 4.22j ) mean that the 
indices introduced by the subject and the object must be of sort tank-ent and mass-ent 



respectively (tank_ent is a sister of flight_ent in figure 4.4). Hence, the semantically 
anomalous "Gate 2 contains water." (where the subject introduces an index of sort 
gate2, which is not a subsort of tank_ent) would be rejected. All lexical signs of verb 
forms have their qstore set to {}. For simplicity, I do not show the QSTORE feature 
here. 
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(4.21) 



PHON (contain) 



SYNSEM LOG 



CAT 



VFORM bse 
AUX 



verb 

ASPECT lex_state 



SPR 
SUBJ 







(np 

COMPS <NP 



-PRD 



tanfc_ent|jj^ 
mass-ent 



CONT 



contains L 



ARGl □ 
ARG2 m 



(4.22) 



PHON (contains) 



SYNSEM LOC 



CAT 



VFORM fin 
AUX 



HEAD 

verb 

aspect lexstate 

SPR {) 
SUBJ (NP 



COMPS (NP 



-PRD 



tank_ent[l]^ 



CONT 
pres 



MAIN_PSOA 

contains L 



ARGl ED 
ARG2 m 



The simple past signs of culminating activity verbs are generated by ( 4.23| ), shown 
below. The simple past signs of non-culminating activity verbs are generated by a 
lexical rule that is similar to ( 4.23j ), except that it does not introduce a Culm operator 
in the resulting sign. 

The signs of past participles (e.g. "inspected" in "Who had inspected BA737?") are 
generated by two lexical rules which are similar to the simple past ones. There is a 
rule for culminating activity verbs (which introduces a Culm in the past participle 
sign), and a rule for non-culminating activity verbs (that introduces no Culm). Both 
rules do not introduce Past operators. The generated signs have their VFORM set to 
psp (past participle), and the same aspect as the base signs, i.e. their aspect is not 
changed to cnsqstate (consequent state). The shift to consequent state takes place 
when the auxiliary "had" combines with the past participle (this will be discussed in 



section |4.7.2[) . 



CHAPTER 4. FROM ENGLISH TO TOP 134 
(4.23) Simple Past Lexical Rule (Culminating Activity Base Form): 

PHON (A) 



SYNSEM LOC 



CAT 



VFORM bse 
AUX 



verb 

ASPECT culmact 



CONT U] 



PHON {morph(\, simplejpast)) 



SYNSEM LOC 



CAT 

CONT 
past 



HEAD 



VFORM fin 
AUX 



verb 

ASPECT culmact 



ET_HANDLE 



MAIN_PSOA 



temp_ent 



TVAR + 



culm 



MAIN_PSOA 



The signs for present participles (e.g. "servicing" in "Which company is servicing 
BA737?") are generated by ( 4.24| ). The present participle signs are the same as 
the base ones, except that their VFORM is prp (present participle), and their aspect 
is progressive (progressive state). 

(4.24) Present Participle Lexical Rule: 



PHON (A) 



SYNSEM | LOC 



CAT 



verb 

ASPECT aspect 



VFORM bse 
AUX 



CONT m 



PHON {morph(X, presentjparticiplej) 



SYNSEM LOC 



CAT 



VFORM prp 
AUX 



verb L 
ASPECT progressive 



CONT [U 



Gerund signs are generated by a lexical rule that is similar to ( |4.24| ), except that the 
generated signs retain the aspect of the original ones, and have their VFORM set to 
ger. In English, there is no morphological distinction between gerunds and present 
participles. Hpsg and most traditional grammars (e.g. [ Thomson fc Martinet 86| ), 



CHAPTER 4. FROM ENGLISH TO TOP 



135 



however, distinguish between the two. In ( [4.25 ), the "inspecting" is the gerund of "to 
inspect", while in ( 4.260 , the "inspecting" is the present participle. 



(4.25) J.Adams finished inspecting BA737. 

(4.26) J.Adams was inspecting BA737. 



The fact that gerund signs retain the aspect of the base signs is used in the treatment 
of "to finish" (section |2.6| ). The simple past "finished" receives multiple signs. (These 
are generated from corresponding base form signs by the simple past lexical rules.) 
(|4.27j ) is used when "finished" combines with a culminating activity verb phrase, and 



( 4.28| ) when it combines with a state or activity verb phrase. 



(4.27) 



phon (finished) 



SYNSEM LOC 



CAT 



HEAD 

verb 
ASPECT point 

SPR () 

SUBJ ([]]) 



VFOR.M fin 
AUX 



COMPS 



(VP 



CONT 



past 



ET_HANDLE 



SUBJ (0), VFORM ger, ASPECT culmact 
TVAR + 



m > 



temp_ent 



MAIN_PSOA 

end 



MAIN_PSOA 



culm 



MAIN_PSOA [U 



(4.28) 



PHON (finished) 



SYNSEM LOC 



CAT 



HEAD 



VFORM fin 
AUX 



verb L 
ASPECT point 

SPR {) 

SUBJ (CD) 



COMPS 



(VP 



CONT 
past 



ET_HANDLE 



MAIN_PSOA 



SUBJ ([]]}, VFORM ger, 
ASPECT state V activity 



i TVAR + 

temp_ent 

MAIN.PSOA H 

end - 



m) 



In (|4.27[) , the vp[subj <\T\>, vform ger, aspect culmact] :[! means that "finished" 
requires as its complement a gerund verb phrase (a gerund that has combined with 
its complements but not its subject) whose aspect is culminating activity. The [D of 
COMPS points to a description of the required subject of the gerund verb phrase, and 
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the [2] is a pointer to the CONT value of the sign of the gerund verb phrase. The two 



[0s in (1-27) have the effect that "finished" requires as its subject whatever the gerund 
verb phrase requires as its subject. The two H]s cause the sign of "finished" to inherit 
the CONT value of the sign of the gerund verb phrase, but with additional Past, End, 



and Culm operators. ( [4.28| ) is similar, but it introduces no Culm. 



In ( |4.25| ), the sign of the gerund "inspecting" retains the aspect of the base sign, 
which in the airport domain is culmact. The sign of the gerund verb phrase "in- 
specting BA737" inherits the culmact aspect of the gerund sign (following the aspect 
principle, to be discussed in section |4.11.1| ). Hence, (|4.27| ) is used. This causes ( 4.25 ) 



to receive a sign whose CONT represents ( [4.29D , which requires the inspection to have 
been completed. 

(4.29) Past[e v , End[Culm[inspecting(occr v , jadams, ba737)]]] 

In ( |4.3C| ), the sign of "circling" inherits the activity aspect of the base sign, causing 
( 4.2^ ) to be used. This leads to ( 4.31| ), which does not require any completion to have 



been reached. 

(4.30) BA737 finished circling. 

(4.31) Past[e v , End[cir cling (6a737)] 

There is also a sign of the simple past "hnished" for the case where the gerund verb 
phrase is a point. In that case, the CONT of the sign of "finished" is identical to the 
the CONT of the sign of the gerund verb phrase, i.e. the "finished" has no semantic 



contribution. This is in accordance with the arrangements of section [2^. The signs of 
"started", "stopped", and "began" are similar, except that they introduce Begin oper- 
ators instead of End ones. Unlike "finished", the signs of "stopped" do not introduce 
Culm operators when "stopped" combines with culminating activities, reflecting the 
fact that there is no need for a completion to have been reached. 

4.7.2 Auxiliary verbs and multi-word verb forms 

I now move on to auxiliary verbs and multi-word verb forms (e.g. "had departed", "is 
inspecting"). ( 4.32; ) shows the sign of the simple past auxiliary "had". According to 



(|31), "had" requires as its complement a past participle verb phrase. The 0s mean 
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that "had" requires as its subject whatever the past participle verb phrase requires as 
its subject. The [Us mean that the MAlN_PSOA value of the perf is the CONT value of 
the sign of the past participle verb phrase. 



(4.32) 



phon (had) 



SYNSEM LOC 



CAT 



CONT 



HEAD 



VFORM fin 
AUX + 



verb 

aspect cnsqstate 

SPR {) 
SUBJ <[]]) 



COMPS 



(VP 



past 



ET_HANDLE 



SUBJ {[H}, VFORM psp 
TVAR + 

temp_ent - 
ET_HANDLE 



m > 



MAIN_PSOA 

perf 



temp_ent 
MAIN_PSOA [U 



TVAR + 



In the airport domain, the past participle "departed" receives multiple signs (for various 
habitual and non-habitual uses; these signs are generated from the corresponding base 
form signs by the lexical rules of section 4.7.1| ). The sign of ( 4.34] ) is used in (4.33). 



(4.33) 
(4.34) 



BA737 had departed. 

PHON (departed) 



SYNSEM LOC 



CAT 



VFORM psp 
AUX 



CONT 



HEAD 

verb 
ASPECT point 

SPR {) 
SUBJ (NP -PRD 

COMPS () 

ARGl m 

actLdepart - 



f light-en 



According to ( |4.34j ), "departed" requires no complements, i.e. it counts as a verb 
phrase, and can be used as the complement of "had". When "had" combines with 
"departed", the SUBJ of (^3|) becomes the same as the subj of fl43l) (b ecause of 
the CDs in ( 4,32| )), and the main_psoa of the perf in ( 4.32j ) becomes the same as the 
CONT of ( PI (b ecause of the H]s in ( [4.32|) ). The resulting constituent "had departed" 
receives (Of]). 
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(4.35) 



phon {had, departed) 



SYNSEM LOC 



CAT 



HEAD 



VFORM fin 
AUX + 



verb 

ASPECT cnsqstate 
SPR {) 



CONT 



SUBJ (NP 
COMPS (} 

ET_HANDLE 



MAIN_PSOA 



-PRD 



f light_en 



past 



TVAR + 



temp_ent 

ET_HANDLE 

MAIN_PSOA 



perf 



temp_ent 



actLdepart 



TVAR + 
ARGl H 



The Hpsg principles (including the semantics and aspect principles that will be dis- 
cussed in sections |4.9.l| and |4.11.l| ) cause ( |4.35|) to inherit the head, aspect, SPR, 



SUBJ, and CONT values of (|4.32[) . Notice that this causes the aspect of "had departed" 
to become consequent state ("departed" was a point). As will be discussed in section 



4.9| , the proper name "BA737" contributes an index that represents the flight BA737. 
When "had departed" combines with its subject "BA737", the index of "BA737" be- 
comes the ARGl value of ( 4.35| ) (because of the Us of flOp). This causes to 
receive a sign whose CONT represents ( 4.36| ). 



(4.36) Past[el v , Perf[e2 v ,actLdepart(ba737)]] 



As mentioned in sections 2.5.4 and |3.15 , present perfect forms are treated semantically 
as simple past forms. This is why, unlike the sign of "had", the sign of "has" (shown 



in ( 4.37 )) does not introduce a Perf operator, and preserves the aspect of the past 
participle. This causes "BA737 has departed." to receive the same Top formula as 
"BA737 departed.". 
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(4.37) 



PHON (has) 



SYNSEM LOC 



CAT 



verb 
ASPECT □ 

SPR () 

SUBJ (H) 

COMPS < 



VFOR.M fin 
AUX + 



CONT 
past 



ET_HANDLE 

temp_ent 

MAIN_PSOA H] 



SUBJ {[U}, VFORM psp, ASPECT E 
TVAR + 



m > 



"Does" receives the sign of ( 1.38 ), which indicates that it requires as its complement 
a base verb phrase. The verb phrase must be a (lexical) state. (This is in accordance 
with the assumption of section |2.5.1 that the simple present can be used only with state 
verbs.) ( |4,38j ) and the (habitual) base sign of ( 4.39Q cause ( 4.4CQ to receive ( 4.41 ). 



(4.38) 



(4.39) 



(4.40) 



PHON (does) 



SYNSEM LOC 



CAT 



CONT 



PHON 



(service) 



SYNSEM LOC 



CAT 



VFORM fin 
AUX + 

verb 

ASPECT lex_state □ 
SPR () 

subj (m) 



COMPS 



(VP 



pres 



SUBJ {[U}, VFORM bse, ASPECT 
MAIN.PSOA E] 



:IU > 



HEAD 



VFORM bse 
AUX 



verb 

ASPECT lex_state 



SPR 
SUBJ 

COMPS 





(np 



ro in pa ri (/.en 



eE]) 



flight-en 



CONT 

hab_servicer_of 

Does Airserve service BA737? 



ARGl GO 

ARG2 no 
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(4.41) 



phon (Does, Airserve, service, BA737) 



SYNSEM LOC 



CAT 



HEAD 

verb L 
ASPECT 

SPR 

SUBJ 

COMPS 



VFORM 
AUX 

lex_state 







bse 
+ 



CONT 



MAIN_PSOA 



pres <- 



hab_servicer_of 



ARGl airserve 
ARG2 ba737 



In the airport domain, the base form of "to service" receives also a sign that corresponds 



to the non-habitual homonym. This is similar to (4.39), but it introduces the predicate 
functor actl servicing, and its aspect is culmact. This sign cannot be used in ( 4.40| ), 
because ( 4.38| ) requires the verb-phrase complement to be a state not a culminating 
activity. This correctly predicts that ( f4.40| ) cannot be asking if Airserve is actually 
servicing BA737 at the present moment. 

"Did" receives two signs: one for culminating-activity verb-phrase complements (shown 
in ( |4.42[ )), and one for state, activity, or point verb-phrase complements (this is similar 
to ( |4.42| ) , but introduces no Culm). In both cases, a Past operator is added. In the 
case of culminating-activity complements, a Culm operator is added as well. 



(4.42) 



PHON (did) 



SYNSEM LOC 



CAT 



VFORM fin 

HEAD 

AUX 

verb 

ASPECT culmact □ 
SPR () 
SUBJ (H) 



CONT 



COMPS (VP 



ET_HANDLE 



MAIN_PSOA 



past 



SUBJ {[U}, VFORM bse, ASPECT CD 
TVAR + 



® > 



temp_ent 



culm 



MAIN_PSOA [U 



The non-habitual sign of "service" and ( 4.42 ) cause (443) to be mapped to ( 4.44| ), 
which requires Airserve to have actually serviced BA737 in the past. The habitual sign 
of ( 4.39 ) and the "did" sign for non-culminating activity complements cause ( ^4,43j ) to 
be mapped to ( 4.45| ), which requires Airserve to have been a past habitual servicer of 
BA737. 
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(4.43) Did Airserve service BA737? 

(4.44) Past[e v , Culm[actl servicing (occr v , airserve, 6a737)]] 

(4.45) Past[e v , hab_servicer_of (airserve, 6a737)] 



The sign for the auxiliary "is" is shown in ( |4.46|) . The present participle "servicing" 
receives two signs, a non-habitual one (shown in ( 4.47] )) and a habitual one. The latter 
is similar to ( (4.47| ), but it introduces the functor habservicer_of , and its ASPECT 
is lexstate. (The two present participle signs are generated from the base ones by 
the present participle lexical rule of section |4.7.l[ ) (|4.46| ) and ( 4.47] ) cause ( 4.48j ) to 
be mapped to ( 4.49j ), which requires Airserve to be actually servicing BA737 at the 
present. ( 4.46| ) and the habitual present participle sign cause ( 4.48 ) to be mapped to 
(pL50l ), which requires Airserve to be the current habitual servicer of BA737. 



(4.46) 



(4.47) 



(4.48) 
(4.49) 
(4.50) 



PHON (is) 



SYNSEM LOC 



CAT 



CONT 



PHON (servicing) 



SYNSEM LOC 



CAT 



CONT 



HEAD 



VFORM fin 
AUX + 



verb 

ASPECT progressive 
SPR () 
SUBJ ([]]) 



COMPS 



(VP 



pres 



SUBJ {[]]}, VFORM 
MAIN_PSOA [U 



HEAD 



VFORM prp 
AUX 



verb 

ASPECT culmact 



SPR 
SUBJ 

COMPS 





(np 



ro rnpa n (/.en 



em) 



flight.en$2}) 

ARG 1 occr_var 
ARG2 CD 
ARG 3 m 



actLservicing 

Airserve is servicing BA737. 

Pres[actLser vicing (occr v , air serve, 6a737)] 

Pres [hab_servicer _of (air serve, 6a737)] 



m) 
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The sign for the auxiliary "was" is similar to ( 4.46j ), except that it introduces a Past 
operator instead of a Pres one. 



4.8 Predicative and non-predicative prepositions 



Following Pollard and Sag ( [pollard fc Sag 87| ], p. 65), prepositions receive separate 
signs for their predicative and non-predicative uses. In sentences like ( 4.51 ) and (|4.52j ), 
where the prepositions introduce complements of "to be", the prepositions are said to 
be predicative. In ( [4.53|) and ( 4.54 ), where they introduce complements of other verbs, 
the prepositions are non-predicative. 

(4.51) BA737 is at gate 2. 

(4.52) BA737 was on runway 3. 

(4.53) BA737 (habitually) arrives at gate 2. 

(4.54) BA737 landed on runway 3. 

Predicative prepositions introduce their own Top predicates, while non-predicative 
prepositions have no semantic contribution. 

4.8.1 Predicative prepositions 



( 4.55| ) shows the predicative sign of "at". (The predicative signs of other prepositions 
are similar.) The prd + shows that the sign is predicative, (prd is also used to 



distinguish predicative adjectives and nouns; this will be discussed in sections [4J} 
and |41Cl .) pform reflects the preposition to which the sign corresponds. Signs for 
prepositional phrases inherit the pform of the preposition's sign. This is useful in 
verbs that require prepositional phrases introduced by particular prepositions. 



(4.55) 



PHON 



(at) 



SYNSEM LOG 



CAT 



prep 



PFORM at 
PRD + 





(np 

COMPS (NP 



SPR 
SUBJ 



-PRD 



CONT 



located_at 



ARGl S 

ARG2 non_temp_en$2\ 
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According to (4.55), "at" requires a (non-predicative) noun-phrase ( "BA737" in ( [4.51] )) 
as its subject, and another one ("gate 2" in ( |4.51D ) as its complement. As will be 



discussed in section 4.9, "BA737" and "gate 2" contribute indices that represent the 
corresponding world entities. The [H of ( 4.55| ) denotes the index of "gate 2". (4.55) 
causes "at gate 2" to receive ( 4.56] ). 



(4.56) 



phon (at, gate2) 



SYNSEM LOC 



CAT 



CONT 



PFORM at 

HEAD 

PRD + 

prep 
SPR {) 

SUBJ (NP -PRD q-]) 
COMPS () 

ARGl Q] 



located_at 



ARG2 gate2 



Apart from ( |4.46| ) (which is used when "is" combines with a present-participle com- 



plement), "is" also receives ( 4.57 ) (which is used when "is" combines with predicative 
prepositional-phrases) . 



(4.57) 



PHON 



(is) 



SYNSEM LOC 



CAT 



VFORM fin 
AUX + 



HEAD 

verb 

aspect lex_state 

SPR {) 
SUBJ (H) 



COMPS 



SUBJ {[U), PRD + 



m ) 



CONT 



pries 



MAIN_PSOA [U 



According to ( [4.57 ), "is" requires as its complement a predicative prepositional phrase 
(a predicative preposition that has combined with its complements but not its subject), 
like the "at gate 2" of ( |456|) . flOfp and (pL57|) cause fl45lD to receive (JO|). 
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(4.58) 



phon (BA737, is, at, gate2) 



SYNSEM LOC 



CAT 



VFORM fin 



AUX 

verb L 
ASPECT lex_state 



+ 



SPR 

SUBJ 

COMPS 









CONT 
pres 



MAIN_PSOA 

located_at - 



ARGl 
ARG2 



ba737 
gate2 



Like "is", "was" receives two signs: one for present-participle complements (as in 
"BA737 was circling."), and one for predicative prepositional-phrase complements (as 
m 

(gjj)). These 

are similar to the signs of "was", but they introduce Past operators 
rather than Pres ones. 



4.8.2 Non-predicative prepositions 



The non-predicative sign of "at" is shown in fl£5S| ). (The non-predicative signs of 
other prepositions are similar.) The [I] is a pointer to the CONT value of the sign that 
corresponds to the noun-phrase complement of "at". Notice that in this case the "at" 
has no semantic contribution (the "at" sign simply copies the CONT of the noun-phrase 
complement). 



(4.59) 



PHON 



{at) 



SYNSEM LOC 



CAT 



HEAD 



prep 



PFORM at 
PRD 



SPR 
SUBJ 








COMPS (NP 



-PRD 



m) 



CONT H] 



( 4.59| ) and the habitual sign of "arrives" of ( 4.6C| ) cause (f4.53|) to receive ( |4.61 ), 
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(4.60) 



phon (arrives) 



SYNSEM LOC 



CAT 



VFORM fin 
AUX 



verb 

ASPECT lex_state 



SPR 
SUBJ 

COMPS 





(np 



-PRD 



flight.ent[Tp 
-PRD, PFORM at 



gate=ent\ 



m > 



CONT 
pres 



MAIN_PSOA 
hab_arrive_at L 



ARGl on 

ARG2 H 



(4.61) 



phon (BA737, arrives, at, gate2) 



SYNSEM LOC 



CAT 



VFORM fin 
AUX 



verb L 
aspect lexstate 

SPR {) 
SUBJ {} 
_COMPS {} 



CONT 
pres 



MAIN_PSOA 
hab_arrive_at L 



argI ba 737 
arg2 gate2 



The (predicative and non-predicative) prepositional signs of this section are not used 
when prepositions introduce temporal adverbials (e.g. "BA737 departed at 5:00pm."). 
There are additional prepositional signs for these cases (see section [1.11 below). 



4.9 Nouns 



Like prepositions, nouns receive different signs for their predicative and non-predicative 
uses. Nouns used in noun-phrase complements of "to be" (more precisely, the lexical 
heads of such noun-phrase complements), like the "president" of ( |4.62[) , are predicative. 
The corresponding noun phrases (e.g. "the president" of ( |4.62| )) are also said to be 
predicative. In all other cases (e.g. "the president" of (|4.63|) ), the nouns and noun 
phrases are non-predicative. 

(4.62) J.Adams is the president. 

(4.63) The president was at gate 2. 
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Let us first examine non-predicative nouns. (4.64) shows the sign of "president" that 
would be used in ( 4.63| ). The prd value shows that the sign corresponds to a non- 
predicative use of the noun. The SPR value means that the noun requires as its specifier 
a determiner (e.g. "a", "the"). 



(4.64) 



phon (president) 



SYNSEM LOC 



CAT 



CONT 



PRD 



LOC I CAT I HEAD det 



HEAD 

noun 

SPR ( 

SUBJ () 
_COMPS () 

index person_enCD 



nom_obj 



RESTR 



• ntense 



ET_HANDLE 

temp_ent 

MAIN_PSOA 

president 



TVAR + V now 
ARGl Q] 



QSTORE {} 



The CONT values of signs that correspond to non-predicative nouns are of sort nom-obj 
(section |4.5| ). The index value stands for the world entity described by the noun, and 
the restr value represents Top expressions that are introduced by the noun. 



( 4.65| ) shows the sign of "the" that is used in ( 4.63| ). (In this thesis, "the" is treated 
semantically ". This is of course an over-simplification.) 



(4.65) 



PHON 



(the) 



SYNSEM LOC 



CAT 



det 



SPEC LOC 



CAT 



HEAD 

noun 

spR 

SUBJ {) 
COMPS {) 



CONT m 



SPR 

SUBJ 

COMPS 

DET 









QSTORE 



CONT 
quant 



exists 



RESTIND 



TVAR + 







m 
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The SPEC feature of ( 4.65Q means that "the" must be used as the specifier of a non- 
predicative N, i.e. as the specifier of a non-predicative noun that has combined with 
its complements and that requires a specifier. The H]s of (4.65) cause an existential 
quantifier to be inserted into the quantifier store, and the H]s cause the restind of 
that quantifier to be unified with the CONT of the TV's sign. 



According to ( [4.64| ), "president" is non-predicative, it does not need to combine with 
any complements, and it requires a specifier. Hence, it satisfies the spec restrictions 



of ( 4.65 ), and "the" can be used as the specifier of "president". When "the" combines 



with "president", the restind of (|4.65Q is unified with the CONT of fl4.64Q (because 
of the [2]s in (|4.65| )), and the QSTORE of (|4.65| ) becomes (|4.66| ) (because of the Us in 
( 4. 65| ) ) . The resulting noun phrase receives ( [4.67| ). 



(4.66) 



RESTIND 



exists 

INDEX 



RESTR 



person_ent 



TVAR 



- ntense 



ET_HANDLE 

temp_ent 

MAIN_PSOA 

president 



TVAR + 



V now 



ARGl CD 



® 



CD 



(4.67) 



phon {the, president) 



SYNSEM LOC 



CAT 



HEAD 

noun 
SPR {) 

SUBJ {) 

COMPS () 



PRD 



CONT 



nom_obj 



person_ent 



TVAR + 



CD 



RESTR 



• ntense 



ET_HANDLE 

temp_ent 

MAIN_PSOA 

president 



TVAR + V now 
ARGl CD 



CH 



QSTORE 



quant L 



det exists 

RESTIND CD 



According to the head feature principle (section [4.2.3 ), ( |4.67| ) inherits the head of 
( 4. 64| ) (which is the sign of the "head daughter" in this case). The propagation of 
CONT and QSTORE is controlled by the semantics principle, which in this thesis has the 
simplified form of ( |4.68j ) . (( |4.68| ) uses the terminology of [ Pollard &: Sag 94| ]. I explain 
below what ( 4.68| ) means for readers not familiar with [ Pollard Sz Sag 94] |.) 
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(4.68) Semantics Principle (simplified version of this thesis): 

In a headed phrase, (a) the QSTORE value is the union of the QSTORE values 
of the daughters, and (b) the synsem|loc|cont value is token-identical with 
that of the semantic head. (In a headed phrase, the semantic head is the 
adjunct-daughter if any, and the head-daughter otherwise.) 



Part (a) means that the QSTORE of each (non-lexical) syntactic constituent is the union 
of the QSTOREs of its subconstituents. Part (b) means that each syntactic constituent 
inherits the CONT of its head-daughter (the noun in noun-phrases, the verb in verb 
phrases, the preposition in prepositional phrases), except for cases where the head- 
daughter combines with an adjunct-daughter (a modifier). In the latter case, the 
mother syntactic constituent inherits the CONT of the adjunct-daughter. (This will be 
discussed further in section 4.11.) Readers familiar with flPollard &; Sag 94 ] will have 
noticed that ( [4.68 ) does not allow quantifiers to be unstored from QSTORE. Apart from 
this, ( |0|| is the same as in |Pollard &: Sag 94| . 



fl4.68j) causes the QSTORE of (|4.67| ) to become the union of the QSTOREs of (|4.64[) (the 
empty set) and ( [4.65| ) (which has become ( 4.66[) ). Since "the president" involves no 



adjuncts, the "semantic head" is the "head-daughter" (i.e. "president"), and ( 4.67 ) 
inherits the CONT of (|4.64j) (which is now the restind of fl4.66|) ). 



The "gate 2" of ( p33j) is treated as a one-word proper name. (In the prototype Nlitdb, 
the user has to type "terminal 2" as a single word; the same is true for "J.Adams" 
of ( |4.62| ). This will be discussed in section |6.6.1| .) Proper names are mapped to signs 
whose CONT is a nom-obj with an empty-set restr.^ "Gate 2", for example, receives 



(Pi). 



(4.69) 



PHON 



(gateS) 



SYNSEM LOC 



CAT 



CONT 
nom_obj 



HEAD 

noun 

SPR {) 

SUB.J {) 

COMPS {) 

index gate2 

RESTR {} 



QSTORE {} 



In JPollard fc Sag 94| , the signs of proper names involve naming relations, and context and BACK- 
GROUND features. These are not used in this thesis. 
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The predicative sign of "at" of ( |4.55| ), the predicative sign of "was" (which is similar 
to ( 4.57 ), except that it introduces a Past), and ( f4.69| ) cause the "was at gate 2" of 
flOaD to receive (pL70D . 



(4.70) 



phon (was, at, gate2) 



SYNSEM LOC 



CAT 



VFORM fin 
AUX + 



verb 

aspect lex_state 

SPR {) 
SUBJ (NP -PR] > 

COMPS (} 



CONT 



past 



ET_HANDLE 

temp_ent 

MAIN_PSOA 

located_at 



TVAR + 
ARGl CD 

arg2 gate2 



QSTORE {} 



When "was at gate 2" combines with "the president", (4.63) receives (4.71). According 



to the semantics principle, the QSTORE of ( |4.71[ ) is the union of the QSTOR.es of ( |4.70| ) 
and ( |4.67D , and the CONT of ( |4.71[ ) is the same as the CONT of ( [4.70| ). 



(4.71) 



PHON 



(the, president, was, at, gate2) 



SYNSEM LOC 



CAT 



VFORM fin 
AUX + 



HEAD 

verb 

aspect lexstate 

SPR {) 
SUBJ {) 
COMPS {) 



CONT 



past 



QSTORE 



DET 



ETJJANDLE 

temp_ent 

MAIN_PSOA 

located_at 

exists 
INDEX 



TVAR + 
ARGl [0 

arg2 gate2 



RESTIND 



nom_obj 



person_ent 



TVAR + 



m 



R.ESTR 



• ntense 



ETJJANDLE 

temp_ent 

MAIN_PSOA 

president 



TVAR + V now 
ARGl CD 



(|4.72| ) is then extracted from (|4.71| ), as discussed in section 4^. Whenever an Ntense 
operator is encountered during the extraction of the Top formulae, if there is no definite 
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information showing that the first argument of the Ntense should be now*, the first 



argument is taken to be a variable. (4.71), for example, shows that the first argument 
of the Ntense could be either a Top variable or now*. Hence, in ( |4,72| ) the first 
argument of the Ntense has become a variable (t v ). During the post-processing phase 



(section 4.17 below), the Ntense of ( 4.72| ) would give rise to two separate formulae: one 



where the first argument of the Ntense has been replaced by now* (current president), 
and one where the first argument of the Ntense has been replaced by the e v of the 
Past operator (president when at gate 2). In contrast, if the sign shows that the first 
argument of the Ntense is definitely now*, the first argument of the Ntense in the 
extracted formula is now* , and the post-processing has no effect on this argument. 

(4.72) Ntense[t v ,president(p v )\ A Past[e v ,located.at(p v , gate2)] 

It is possible to force a (non-predicative) noun to be interpreted as referring always 
to the speech time, or always to the time of the verb tense. (This also applies to the 



non-predicative adjectives of section 4.1C below.) To force a noun to refer always to the 



speech time, one sets the et_handle of the ntense in the noun's sign to simply now 



(instead of allowing it to be either now or a variable-representing index as in (4.64)). 



This way, the et_handle of the ntense in ( [4.71| ) would be now. ( 4.72| ) would contain 



now* instead of f (because in this case the sign shows that the first argument of the 
Ntense should definitely be now*), and the post-processing mechanism would have no 
effect. 

To force a noun to refer always to the time of the verb tense, one simply omits the 
Ntense from the noun's sign. This would cause the formula extracted from the sign of 

flOap to be flETSp . 

(4.73) president(p v ) A Past[e v , located_at(p" , gate2)] 

The semantics of Top's conjunction (section |3.6|) and of the Past operator (section 



3.8|) require president(p v ) and located _at(p v ', gate2) to be true at the same (past) event 
time. Hence, (|4.73| ) expresses the reading where the person at gate 2 was the president 
of that time. 



There are however, two complications when (non-predicative) noun signs do not in- 
troduce Menses. (These also apply to adjective signs, to be discussed in section 
4.10| .) First, a past perfect sentence like (474) receives only ( 4.75 ), which requires 
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president (p v ) to be true at the event time pointed to by eV (the "reference time", 
which is required to fall within 1/1/95). That is, "the president" is taken to refer to 
somebody who was the president on 1/1/95, and who may not have been the president 
during the visit. 

(4.74) The president had visited Rome on 1/1/95. 

(4.75) presidential") A At[l/l/95 , Past[el v , Perf[e2 v , visiting(p v , rome)]]] 

In contrast, if the sign of "president" introduces an Ntense, the formula extracted from 
the sign of (gjj) is fll76|) . The post -processing generates three different formulae from 



(|4.76| ). These correspond to readings where "president" refers to the time of the visit 
(t v replaced by e2 v ), the reference time (t v replaced by el v , equivalent to ( 4.75| )), or 
the speech time (t v replaced by now*). 

(4.76) Ntense[t v ,president(p v )]A 

At[l/l/95, Past [el v , Perf [e2 v , visiting(p v , rome)}]} 

The second complication is that (non-predicative) nouns that do not introduce Ntenses 
are taken to refer to the time of the main clause's tense, even if the nouns appear 
in subordinate clauses (subordinate clauses will be discussed in section [4.14 ). For 



example, if "president" does not introduce an Ntense, ( 4.77 ) is mapped to ( 4.78 ). The 
semantics of ( |4.78| ) requires the visitor to have been president during the building 
of terminal 2 (the visitor is not required to have been president during the visit to 
terminal 3). 

(4.77) Housecorp built terminal 2 before the president visited terminal 3. 

(4.78) president(p v ) A Before[Past[el v ,visiting(p v ,term3)], 

Past [e2 v , Culm[building(housecorp, term2)]]] 



In contrast, if "president" introduces an Ntense, the post-processing (section 4.17 
below) generates three readings, where "president" refers to the speech time, the time 
of the building, or the time of the visit. 

The non-predicative signs of nouns like "day" or "summer", that refer to members of 
partitionings (section |3.4j ) are similar to the non-predicative signs of "ordinary" nouns 
like "president", except that they introduce Part operators, and they do not introduce 
Ntenses. (|4.79| ), for example, shows the non-predicative sign of "day". (The day and 



day_ent_var sorts are as in figure [4.4] on page |12q . 
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(4.79) 



PHON 



(day) 



SYNSEM LOC 



CAT 



HEAD 

noun 



PRD 



SPR 

SUBJ () 
COMPS () 



LOC I CAT I HEAD det 



CONT 



nom_obj 



RESTR 



_ part 



_ent_var\±] 

PARTNG day 
PART_VAR Q] 



QSTORE {} 



Names of months and days (e.g. "Monday", "January") that can be used both with and 
without determiners (e.g. "on a Monday", "on Monday") receive two non-predicative 
signs each: one that requires a determiner, and one that does not. Finally, proper 
names that refer to particular time-periods (e.g. the year-name "1991", the date 
"25/10/95") receive non-predicative signs that are similar to those of "normal" proper 
names (e.g. "gate 2"), except that their index values are subsorts of temp-ent rather 
than non_temp_ent. I demonstrate in following sections how the signs of temporal 
nouns and proper names (e.g. "day", "25/10/95") are used to form the signs of tem- 
poral adverbials (e.g. "for two days", "before 25/10/95"). 

4.9.2 Predicative nouns 



I now turn to predicative nouns, like the "president" of (|46^ ). (E~80D shows the 
predicative sign of "president". Unlike non-predicative noun-signs, whose CONT values 
are of sort norri-obj, the CONT values of predicative noun-signs are of sort psoa. The 



president in ( 4.80 ) is a subsort of psoa. 



(4.80) 



PHON (president) 



SYNSEM LOC 



CAT 



CONT 



HEAD 

noun 



LOC | CAT | HEAD det 

D }m> 



SPR 

SUBJ (NP 
COMPS () 

ARGl person_en^\ 

president - 



QSTORE {} 



Unlike non-predicative nouns that do not require subjects (e.g. fl4.64|) ), predicative 



nouns do require subjects. In (|4.80[) , "president" requires a non-predicative noun 
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phrase as its subject. The [Q denotes the index of that noun phrase. 



In the Hpsg version of this thesis, the predicative signs of nouns are generated auto- 
matically from the non-predicative ones by ( |4 . 8 1| ) .p] 

(4.81) Predicative Nouns Lexical Rule: 



SYNSEM LOC 



CAT 



CONT 

nom_obj 
4 



HEAD 

noun 
SUBJ () 

INDEX 

RESTR jmj 



SYNSEM LOC 



CAT 



HEAD 
noun 

SUBJ (NP 



PRD + 



-PRD 



CONT remove_ntensel$2\) 



The remove-ntense(\2\) in ( ^4 . 8 1| ) means that if [2] (the single element of the RESTR set 
of the non-predicative sign) is of sort ntense, then the CONT of the predicative sign 
should be the MAlN_PSOA of LH (see also (4.64)). Otherwise, the CONT of the predicative 
sign should be LH. In other words, if the non-predicative sign introduces an Ntense, the 
Ntense is removed in the predicative sign. This is related to the observation in section 



2.11, that noun phrases that are complements of "to be" always refer to the time of the 



verb tense. For example, ( 4.82| ) means that J.Adams was the president in 1992, not at 
the speech time. (fh82) is represented correctly by ( [4.83 ) which contains no Ntenses. 



(4.82) J.Adams was the president in 1992. 

(4.83) ^t[1992, Past[e v , president^ .adams)}] 



Top predicates introduced by predicative nouns (e.g. president(j _adams) in (f4.83j )) 
end up within the operators of the tenses of "to be" (e.g. the Past of ( [4.83D ). This 
requires the predicates to hold at the times of the tenses. 



As with previous lexical rules, features not shown in ( 4.81| ) (e.g. SPR, COMPs) have the 
same values in both the original and the generated signs. For example, ( |4.81| ) generates 
flPcD from ( fOg ). 



7 Apart from the removejntense, (4.81) is e ssentially the same as Borsley's "predicative NP lexical 
rule", discussed in the footnote of p. 360 of [ Pollard fc Sag 94 1. 
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In this thesis, determiners also receive different signs for their uses in predicative and 
non-predicative noun phrases. (Pollard and Sag do not provide much information on 
determiners of predicative noun phrases. The footnote of p. 360 of [Pollard & Sag 94], 
however, seems to acknowledge that determiners of predicative noun phrases have to 
be treated differently from determiners of non-predicative noun phrases.) For example, 
apart from ( |4.65[) , "the" is also given (|4.84|) . The spec of ( [4.841) shows that ( |4.84[) can 
only be used with predicative nouns (cf. (|4.65 )). Unlike determiners of non-predicative 
noun phrases, determiners of predicative noun-phrases have no semantic contribution 
(the synsem|loc|cont of ( |4.84|) is simply a copy of the CONT of the noun, and no 
quantifier is introduced in qstore; cf. ( 4.65| )). 



(4.84) 



PHON 



(the) 



SYNSEM LOC 



QSTORE {} 



CAT 



HEAD 



SPEC LOC 



det 
SPR () 

SUBJ () 

COMPS () 



CONT m 



CAT 



HEAD 

noun 

SPR 



PRD + 



<-) 

SUBJ {_) 
COMPS {) 



CONT 



In ( |4.62|) , when "the" combines with "president", the resulting noun phrase receives 
( 4.85| ). (Hpsg's principles, including the semantics principle of ( 4.68] ), cause ( [4.85 ) to 
inherit the head, SUBJ, and CONT of (|4.80| ) .) 



(4.85) 



PHON {the, president) 



SYNSEM LOC 



QSTORE {} 



CAT 



CONT 



HEAD 

noun 
SPR () 



PRD + 



SUBJ (NP 
COMPS () 

ARGl person^en^T} 

president - 



Apart from ( |4.46 ) and ( 4.57 ), "is" also receives ( |4.86| ), which allows the complement 
of "is" to be a predicative noun phrase. (There is also a sign of "is" for adjectival 
complements, as in "Runway 2 is closed."; this will be discussed in section 4.10| . "Was" 
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receives similar signs.) The Us in ( 4.86| ) denote the CONT of the predicative noun- 
phrase. 



(4.86) 



PHON (is) 



SYNSEM LOC 



CAT 



HEAD 



VFORM fin 
AUX + 



verb L 
ASPECT lex_state 



SPR 
SUBJ 







COMPS (NP 



SUBJ {H}, PRD + 



CONT 
pres 



MAIN_PSOA H 



QSTORE {} 



( 4.86j ) and ( 4.85] ) cause the "is the president" of fl4.62| ) to receive ( [4.87| ). Finally, when 
"is the president" combines with "J.Adams", ( [4.62| ) receives a sign with an empty 
QSTORE, whose CONT represents ( [4.881) . 



(4.87) 



PHON (is, the, president) 



SYNSEM LOC 



CAT 



HEAD 



VFORM fin 
AUX + 



verb 

aspect lexstate 





SPR 
SUBJ (NP 
COMPS (} 



-PRD 



a) 



CONT 
pres 



MAIN_PSOA 

president 



ARGl person_en1[t} 



QSTORE {} 

(4.88) Pres\president(j-adams)] 



There are currently two complications with predicative noun phrases. The first is that 
in the non-predicative signs of proper names like "gate2", the value of restr is the 
empty set (see ( [4. 69 )). Hence, ( 4.81 ) does not generate the corresponding predicative 
signs, because the non-predicative signs do not match the restr description of the 
LHS of (Eh8lD (which requires the restr value to be a one-element set). This causes 
( 4.89| ) to be rejected, because there is no predicative sign for "J. Adams" . 



(4.89) The inspector is J.Adams. 



One way to solve this problem is to employ the additional rule of fl4.90|) . 
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SYNSEM LOC 



CAT 



CONT 
nom_obj 



HEAD PRD 

noun 
SUBJ {) 

INDEX 
RESTR {} 



SYNSEM LOC 



CAT 



CONT 



HEAD PRD + 
noun 

SUBJ (NP -PCD 



m> 



identity 



ARGl □ 
ARC 2 m 



This would generate ( |4.91| ) from the non-predicative sign of "J.Adams" (which is similar 
to (|469|) ). ( ^86|) and (^91p would cause (JL8|) to be mapped to (]492|) . (I assume 
here that the non-predicative "inspector" does not introduce an Ntense.) 



(4.91) 



(4.92) 



PHON {J.Adams) 



SYNSEM LOC 



CAT 



CONT 



HEAD 

noun 
SPR {) 

SUBJ (NP 

COMPS () 



-PRD 



identity 



ARGl j_adams 
ARG2 H 



QSTORE {} 

inspector (insp v ) A Pres[identity(j_adams,insp v )] 



identity (ti,T2) is intended to be true at event times where its two arguments denote the 
same entity. This calls for a special domain-independent semantics for identity (n, T2). 
I have not explored this issue any further, however, and ( [4.90 ) is not used in the 
prototype Nlitdb. 

A second complication is that the non-predicative sign of "Monday" (which is similar 
to ( 4.79D ) and the treatment of predicative noun phrases above lead to an attempt to 
map (|493|) to (|494| ). 



(4.93) 23/10/95 was a Monday. 
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( 4.94| ) is problematic for two reasons, (a) The past tense of ( gjgg ) is in effect ignored, 
because the denotation of Part[a, 0\ does not depend on It, which is what the Past op- 
erator affects (see sections |3.6| and 3.8). Hence, the implication of Q4.93 ) that 23/10/95 
is a past day is not captured. This problem could be solved by adding the constraint 
g{/3) E It in the semantics of Part [a, 0\ (section [3.6] ). (b) ( |4.94| ) violates the syntax of 
Top (section |3.2j ), which does not allow the second argument of a Part operator to 
be a constant. This problem could be solved by modifying Top to allow the second 
argument of Part to be a constant. 



4.10 Adjectives 



Following Pollard and Sag ([ pollard fc Sag 87 ], pp. 64 - 65), adjectives also receive 
different signs for their predicative and non-predicative uses. When used as comple- 
ments of "to be" (e.g. "closed" in ( 4.95| )) adjectives are said to be predicative. In all 
other cases (e.g. "closed" in ( 4.96] )), adjectives are non-predicative. (( [4.95 ) is actually 
ambiguous. The "closed" may be a predicative adjective, or the passive form of "to 
close". As noted in section 2.13| , however, passives are ignored in this thesis. Hence, I 
ignore the passive reading of ( 4.95j ).) 



(4.95) Runway 2 was closed. 

(4.96) BA737 landed on a closed runway. 



In the airport domain, the predicative sign of the adjective "closed" is (4.97). 



(4.97) 



phon (closed) 



SYNSEM LOC 



QSTORE {} 



CAT 



CONT 



adj 
SPR () 



PRD + 



-PRD 



SUBJ (NP 
COMPS () 

ARGl (gate_ent V runway^ent^H 

closed - 



As noted in section 4.G , "is" and "was" receive four signs each. One for progressive 
forms (see ( |4.46f )), one for prepositional phrase complements (see ( 4.57[) ), one for noun- 



CHAPTER 4. FROM ENGLISH TO TOP 



158 



phrase complements (see Q4.86D ), and one for adjectival complements (( |4.98[ ) below). 
(g9| ) and flP^D cause ( ^95| ) to be mapped to ( pL99l ). 



(4.98) 



PHON 



(was) 



SYNSEM LOC 



CAT 



VFORM fin 
AUX + 



HEAD 

verb 

aspect lex_state 

SPR {) 
SUBJ {H) 



COMPS 



LOC 



CAT 



PRD + 



adj 

SUBJ (H) 
COMPS {} 



CONT 
past 



ET_HANDLE 

temp_ent 

MAIN.PSOA LU 



CONT m 
TVAR + 



(4.99) 



QSTORE {} 

Past[e v , closed{runway2)\ 



Q4.100D shows the non-predicative sign of "closed". The "closed" in ( |4,96| ) is a modifier 
(adjunct) of "runway". The mod in (|4.100[) refers to the synsem of the sign of the 
noun that the adjective modifies. The synsem | LOC | CONT of ([4. 100|) is the same as the 
one of the noun-sign, except that an Ntense is added to the RESTR of the noun-sign 
(i.e. to the set denoted by H]). The additional Ntense requires the entity described by 
the noun (the entity represented by CD) to be closed at some unspecified time. The 
index of the noun's sign is also required to represent a gate or runway. 
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(4.100) 



phon (closed) 



SYNSEM LOC 



CAT 



HEAD 



PRD 



MOD LOC 



CAT 



CONT 
nom_obj 



HEAD noun 



SPR. {_) 

COMPS () 

INDEX H] 

RESTR [U 



adj 
SPR {) 
SUBJ {) 
COMPS {) 

index (gate_ent V runway_entfi±\ 



CONT 



nom_obj L 



RESTR [U U 



. ntense 



ET_HANDLE 

temp_ent 

MAIN_PSOA 

closed 



V now 

ARGl Q] 



QSTORE {} 



In the airport domain, the non-predicative sign of "runway" is ( 4.101|) . (I assume that 
"runway" does not introduce an Ntense.) 

PHON (runway) 



(4.101) 



SYNSEM LOC 



CAT 



HEAD 

noun 

SPR 



PRD 

LOC I CAT I HEAD det 



CONT 
nom_obj 



SUBJ () 
COMPS () 

index runway_entH 

RL-STl! \ ARGl 

runway - 



QSTORE {} 



In ( 4.96 ), "closed" combines with "runway" according to Hpsg's head-adjunct schema 
(see flPollard fc Sag 94f ). "Closed runway" receives the sign of ([4.102D , where H] is the 
set of ( |4. 1Q3| ) . (Sets of psoas are treated as conjunctions.) 
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(4.102) 



phon (closed, runway) 



SYNSEM LOC 



CAT 



HEAD 

noun 

SPR. 



PRD 



CONT 
nom_obj L 



LOC | CAT | HEAD det 

SUBJ () 
COMPS () 

index runway_en§±\ 

RESTR [U 



QSTORE {} 



(4.103) 



runway 



ARGl [D 



ntense 











ET_HANDLE 


TVAR + 


V now 


1 


temp_ent 








MAIN_PSOA 


ARGl 03 






closed 









The principles of Hpsg cause Q4.102D to inherit the HEAD and SPR of ( |4.101| ). (|4.102j ) 
inherits the CONT of ( 4.100| ) according to the semantics principle of ( [4.68| ) (in this case, 
the "semantic head" is the adjunct "closed"). ( 4.102; ), the sign of "landed" (which is 



the same as (44), except that it also introduces Past and Culm operators), and the 
non-predicative sign of "on" (which is similar to fl4.59|) ), cause (|4.96| ) to be mapped to 
the ( 4.104 ). During the post-processing (section gj]] below), (|4.104|) gives rise to two 
different formulae, one where t v is replaced by now* (currently closed runway), and 
one where f is replaced by e v (closed during the landing). 

(4.104) runway(r v ) A Ntense[t v , closed(r v )] A 
Past[e v , Culm[landing-on{occr v , 6o737, r v )]\ 

An additional sign is needed for each non-predicative adjective to allow sentences like 
( [4.105 ), where a non-predicative adjective ( "closed") combines with a predicative noun 
( "runway"). 

(4.105) Runway 2 is a closed runway. 



( 4.100 ) cannot be used in ([4.105 ), because here "runway" is predicative, and hence the 
CONT of its sign is a psoa (the predicative sign of "runway" is similar to ( 4.80[) ). In 
contrast, ( 4.100| ) assumes that the CONT of the noun is a norri-obj. One has to use the 
additional sign of (pXj6|) .Pl Using (jQjp , ( ELlOfiD is mapped to (|4l07|) , which requires 
runway 2 to be closed at the speech time. 



It is unclear how ( [t.l06| ) could be written in the Hpsg version of | Pollard fc Sag 94 1. In 
Pollard fc Sag 9^ , ;he a nd sor t doe s not exist, and conjunctions of psoas can only be expressed 



using sets of psoas, as in (4.103). In (4.106), however, the value of SYSNSEm| LOC | CONT cannot be a 
set of psoas, because CONT accepts only values whose sort is psoa, noni-obj, or quant. 
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(4.106) 



SYNSEM LOC 



CAT 



PRD — 



MOD LOC 



CONT 

and L 



adj 
SPR () 
SUBJ () 
COMPS () 

CONJUNCT! 

CONJUNCT2 



CAT 



HEAD 

noun 
SPR {_) 

SUBJ (NP 

COMPS {} 



PRD + 



CONT m 



closed 



ARGl □ 



QSTORE {} 



(4.107) Pres[runway(runway2) A closed(runway2)] 



As discussed in section ^8, temporal adjectives (e.g. "former", "annual") are not con- 
sidered in this thesis. The prototype Nlitdb allows only non-predicative uses of the 



temporal adjective "current" (as in ( 4.108; )), by mapping "current" to a sign that sets 
the first argument of the noun's Ntense to now*. (This does not allow "current" to be 
used with nouns that do not introduce Ntenses; see section 4.9.1| .) 



(4.108) The current president was at terminal 2. 



4.11 Temporal adverbials 



I now discuss temporal adverbials, starting from punctual adverbials (section |2.9.1| ). 



4.11.1 Punctual adverbials 



Apart from ( [4.55[ ) and ( [4.59D (which are used in sentences like "BA737 is at gate 2." 
and "BA737 (habitually) arrives at gate 2."), "at" also receives signs that are used 
when it introduces punctual adverbials, as in fl4109D . (plOD shows one of these signs. 



(4.109) Tank 2 was empty at 5:00pm. 
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(4.110) 



PHON (at) 



SYNSEM LOG 



CAT 



prep 



PRD — 
MOD S 



VFORM fin 



:[U V VP 



VFORM psp 



m 



MOD | LOC | CAT | ASPECT state V activity 
V point 



CONT 
at_op 



SPR () 
SUBJ () 

COMPS (NP 

ASPECT point 

TIME_SPEC [3 
MAIN_PSOA [U 



iinute_ent[Xl^ 



QSTORE {} 



The mod feature refers to the synsem of the sign of the constituent modified by "at". 
s[vform /m]:[2] is an abbreviation for a finite sentence (a finite verb form that has 
combined with its subject and complements). The [2] refers to the CONT of the sign of 
the finite sentence. Similarly, vp[vform psp] :H] stands for a past participle verb phrase 
(a past participle that has combined with its complements but not its subject). The 
mod of ( 4, HOD means that ( 4.110 ) can be used when "at" modifies finite sentences 
or past participle verb phrases, whose aspect is state, activity, or point. Generally, 
in this thesis temporal adverbials (punctual adverbials, period adverbials, duration 



adverbials) and temporal subordinate clauses (to be discussed in section |4.14|) are 
allowed to modify only finite sentences and past participle verb phrases. 



( 4. HOD and the sign of "5:00pm" (shown in ( 4.111 )) cause "at 5:00pm" to receive 
( 4.112|) ("5:00pm" acts as the noun-phrase complement of "at"). 



(4.111) 



PHON (5:00pm) 



SYNSEM LOC 



CAT 



HEAD 

noun 

SPR 

SUBJ () 
COMPS () 

index minute_ent_vai\±\ 



CONT 



nom_obj 



RESTR 



_ part 



PARTNG 5:00pm 

PART_VAR Q] 



® 



QSTORE 



det exists 

RESTIND [U 



CHAPTER 4. FROM ENGLISH TO TOP 



163 



(4.112) 



PHON 



(at, 5:00pm) 



SYNSEM LOG 



CAT 



HEAD 



PRD 
MOD 



VFORM fin 



:H V VP 



VFORM psp 



m 



QSTORE 



prep 
SPR () 
SUBJ (} 
COMPS (} 
ASPECT point 

TIME_SPEC [3 

CONT 

MAIN_PSOA [U 

det exists 

index minute_ent_varH} 



MOD | LOC I CAT | ASPECT state V activity 
V point 



RESTIND 



RESTR 



part L 



PARTNG 5:00pm 
PART_VAR [3 



According to Hpsg's head feature principle (section [4.2.3 ), ( 4.112| ) inherits the head of 
(|4.110|) ( "at" is the "head-daughter" of "at 5:00pm", and "5:00pm" is the "complement- 
daughter"). Following the semantics principle of ( 4.68| ), the QSTORE of ( 4,112| ) is the 
union of the QSTOREs of (|4.110|) and (|4.111|) , and the CONT of (|4.112|) is the same as 
the CONT of ( |4. 110| ) (in this case, the "semantic head" is the head-daughter, i.e. "at"). 



The propagation of aspect is controlled by ( [4.113 ), a new principle of this thesis. (As 
with ([OsD , m (|4Tl3| ) I use the terminology of flPollard &: Sag 94| .) 



(4.113) Aspect Principle: 

In a headed-phrase, the synsem|loc|cat| aspect value is token-identical 
with that of the semantic head. (In a headed phrase, the semantic head 
is the adjunct-daughter if any, and the head-daughter otherwise.) 



( [4.113 ) means that each syntactic constituent inherits the aspect of its head-daughter 
(the noun in noun phrases, the verb in verb phrases, the preposition in prepositional 
phrases), except for cases where the head-daughter combines with an adjunct-daughter 
(a modifier). In the latter case, the mother syntactic constituent inherits the CONT 



of the adjunct-daughter. (4.113) causes (4.112) to inherit the aspect value of the 
semantic head "at". 



The "tank 2 was empty" of (47lQ9|) receives (4.114). 
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(4.114) 



phon {tank2, was, empty) 



SYNSEM LOC 



CAT 



VFORM fin 



AUX 



HEAD 

verb 

aspect lex_state 

SPR {) 
SUBJ {} 
COMPS {} 



+ 



CONT 



past 



ET_HANDLE 

temp_ent 

MAIN_PSOA 

empty 



TVAR + 
ARGl tank2 



QSTORE {} 



When "tank 2 was empty" combines with "at 5:00pm", ( 4.109| ) receives ( |4.115| ). In this 
case, "tank 2 was empty" is the head-daughter, and "at 5:00pm" is an adjunct-daughter 
(a modifier). Hence, according to ( |4.113| ), ( |4.115| ) inherits the aspect of ( f4.112| ) (i.e. 



point; in contrast, the aspect of (4.114) was lexstate.) This is in accordance with the 



arrangements of section 2.9.1, whereby punctual adverbials trigger an aspectual shift 
to point. 



(4.115) 



phon {tank2, was, empty, at, 5:00pm) 



SYNSEM LOC 



CAT 



HEAD 

verb L 
ASPECT point 

SPR {) 

SUBJ {) 

COMPS {} 

TIME_SPEC E 
MAIN_PSOA 



VFORM fin 
AUX + 



CONT 



at_op 



past 



ET_HANDLE 

temp_ent 

MAIN_PSOA 

empty 



ARGl 



+ 
tank2 



QSTORE < 



exists 

index minute_ent_varH} 



RESTIND 



RESTR 



part 



partng 5:00pm 
PART_VAR 



According to the semantics principle, ( [4,115| ) also inherits the CONT of ft4.112| ) (the 
sign of the modifier), and the QSTORE of ( |4.115| ) is the union of the QSTOREs of ([4.112| ) 
and ( 4.114 ). Finally, according to the head feature principle (section 4.2.3| ), ( 4.115| ) 
inherits the HEAD of ( |4.114|) (the sign of the head-daughter). The QSTORE and CONT 
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The reader may wonder why temporal adverbials (e.g. "at 5:00pm" in ( 4. 109|) ) are 
taken to modify whole finite sentences ( "tank 2 was empty") , rather than finite verb 
phrases ("was empty"). The latter approach leads to problems in questions like "Was 
tank 2 empty at 5:00pm?", where "was" combines in one step with both its subject 
"tank 2" and its complement "empty", following the head-subject-complement schema 



of [Pollard & Sag 94]. In this case, there is no verb phrase constituent (verb that has 
combined with its complements but not its subject) to be modified by "at 5:00pm". 

Apart from finite sentences, temporal adverbials are also allowed to modify past par- 
ticiple verb phrases (see the MOD of ( 4.110| )). This is needed in past perfect sentences 



like d43T7|) . 

(4.117) BA737 had entered sector 2 at 5:00pm. 



As discussed in section 2.5.5 , ( 4.117 ) has two readings: one where the entrance occurs 
at 5:00pm, and one where 5:00pm is a "reference time", a time where the entrance has 
already occurred. The two readings are expressed by ( 4.119j ) and ( 4.121[) respectively 
(see also section 3.15| ). ( 4.117|) is taken to be syntactically ambiguous with two pos- 
sible parses, sketched in (|4~Tl8|) and (]4~T20|) . These give rise to ( [4.119|) and ( [4.121] ) 
respectively. 

(4.118) BA737 had [[entered sector 2] at 5:00pm]. 

(4.119) Part[5:00pm9 , fv v ] A Past[el v , Perf[e2 v , At[fv v , enter [ba737 , sector2)]]] 

(4.120) [BA737 had [entered sector 2]] at 5:00pm. 

(4.121) Part[5:00pm 9 , fv v ] A At[fv v , Past\eV , Perf[e2 v , enter (ba737 , sector2)}}} 



One complication of this approach is that it generates two equivalent formulae for the 
present perfect ( [4,122]) , shown in ( 4.124| ) and ( 4.126|) . ("Has" does not introduce a 
Perf; see ( 4.37 ).) These correspond to the parses of ( |4.123|) and ( 4.125 ) respectively. 



(4.122) BA737 has entered sector 2 at 5:00pm. 

(4.123) [BA737 has [entered sector 2]] at 5:00pm. 

(4.124) Part[5:00pm3 , fv v ] A At[fv v , Past[e v , enter (6a737 ; sector2)}} 
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In the prototype Nlitdb, the sign of "has" is slightly more complex than (4.37). It 
requires the CONT of the verb-phrase complement of "has" to be of sort predicate. 
This blocks flOgg ) and (]4126|), b ecause in ( [4.125D the "at 5:00pm" causes the CONT 
of "entered sector 2 at 5:00pm" to become of sort at_op (it inserts an At operator), 
which is not a subsort of predicate. 



(4.110) corresponds to the interjacent meaning of punctual adverbials, which according 
to table 2.2 on page ^4] is possible only with states and activities. ( [4.110 ) also covers 
cases where punctual adverbials combine with points. There are also other "at" signs, 
that are similar to ( 4. 110| ) but that introduce additional Begin or End operators. These 
correspond to the inchoative (with activities and culminating activities) and terminal 
(with culminating activities) meanings of punctual adverbials. 



4.11.2 Period adverbials 



I now turn to period adverbials (section p. 9. 2 ). (4.127) shows one of the signs of "on" 
that are used when "on" introduces period adverbials. 



(4.127) 



PHON 



(on) 



SYNSEM LOC 



CAT 



prep 



PRD 

MOD S 



VFORM fin 



:[U V VP 



MOD LOC CAT ASPECT 



VFORM psp 

culmact 



m 



SPR 
SUBJ 



(> 
(> 



CONT 
at_op 



COMPS (NP 

ASPECT point 

TIME_SPEC 

MAIN_PSOA 

end 



day^ent] 



MAIN_PSOA [H 



QSTORE {} 



(4.127), which can be used only when the "on ... " adverbial modifies a culminating 
activity, corresponds to the reading where the situation of the culminating activity 
simply reaches its completion within the adverbial's period (table pi on page 49). 



(( 4.127 ) causes the aspectual class of the culminating activity to become point. This 
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agrees with table |2.3| .) For example, Q4.127| ) causes ( [4.128] ) to be mapped to ( 4.129| ). 



(I assume here that "to repair" is classified as culminating activity verb.) Intuitively, 
( }4,129 ) requires a past period e v to exist, such that e v covers a whole repair of engine 



2 by J.Adams (from start to completion), and the end-point of e v falls within some 
Monday. That is, the repair must have been completed on Monday, but it may have 
started before Monday. 

(4.128) J.Adams repaired engine 2 on Monday. 

(4.129) Part[monday 9 ,m v ] A 

At[m v , End[Past[e v , [Culm[repairing(occr v ,j_adams, en<72)]]]| 



There is also an "on" sign that is similar to ( 4.127| ), but that does not introduce an 



End operator, preserves the aspect of the modified expression, and can be used when 
"on ..." adverbials modify expressions from all four aspectual classes. This sign causes 
(|4.128[) to be mapped to ( |4.130| ) (the prototype Nlitdb would generate both ([4.1291 ) 



and ( 4.13C| )). ( 4.130 ) corresponds to the reading where the repair must have both 
started and been completed within a (the same) Monday. The "on" sign that does not 
introduce an End also gives rise to appropriate formulae when "on ... " adverbials 
modify state, activity, or point expressions. 

(4.130) Part[monday 9 ,m v ] A 

At[m v , Past[e v , [Culm[repairing(occr v , jjidams, eng2)]]] 



Both ( 4.127 ) and the "on" sign that does not introduce an End require the noun-phrase 
complement of "on" to introduce an index of sort day^ent. The signs of "1/1/91" and 
"Monday" introduce indices of sorts 1/1/91 and day_ent-var respectively, which are 



subsorts of day-ent (see figure [O] on page |12q ). Hence, "1/1/91" and "Monday" are 
legitimate complements of "on" in period adverbials. In contrast, "5:00pm" introduces 
an index of sort minute-ent_var (see ( [4.111 )), which is not a subsort of day_ent. Hence, 



( 4.131 ) is correctly rejected. 



(4.131) *Tank 2 was empty on 5:00pm. 

The signs of other prepositions that introduce period adverbials (e.g. "in 1991", "before 
29/10/95", "after 5:00pm") and the signs of "yesterday" and "today" are similar to the 
signs of "on", except that "before" and "after" introduce Before and After operators 
instead of Ats. Also, "before" is given only one sign, that does not introduce an 
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End (there is no "before" sign for culminating activities analogous to ( P~I27D , that 
introduces an End). This is related to comments in section 2.9.2| that in the case of 
"before ... " adverbials, requiring the situation of a culminating activity to simply 
reach its completion before some time (reading with End) is equivalent to requiring 
the situation to both start and reach its completion before that time (reading without 
End). 



4.11.3 Duration adverbials 



The treatment of "for ... " duration adverbials is rather ad hoc from a syntax point 
of view. In an adverbial like "for two days", both "two" and "days" are taken to be 
complements of "for", instead of treating "two" as the determiner of "days", and "two 
days" as a noun-phrase complement of "for". 

Number-words like "one", "two", "three", etc. are mapped to signs of the form of 
( 4.132[) . Their restrs are empty, and their indices represent the corresponding num- 
bers. (The 2 of flOgl ) is a subsort of sem_num; see section |4.4|.) 



(4.132) 



PHON (two) 



SYNSEM LOC 



CAT 



CONT 
nom_obj 



HEAD 

SPR 

SUBJ 

COMPS 



det 







SPEC none 



index 2 

RESTR {} 



QSTORE {} 

Although words like "one", "two", "three", etc. are classified as determiners (the head 
of ( |4. 132| ) is of sort det), the none value of their SPEC does not allow them to be used 
as determiners of any noun. (Determiners combining with nouns are the specifiers of 
the nouns. The none means that the word of the sign cannot be the specifier of any 
constituent, and hence cannot be used as the determiner of any noun.) 



( 4.133 ) shows the sign of "for" that is used in duration adverbials (for typesetting 
reasons, I show the feature structures that correspond to HD and \§\ separately, in fl4.134| ) 
and ( 4.135| ) respectively). 
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(4.133) 



PHON (for) 



SYNSEM LOG 



CAT 



prep 



PRD — 
MOD S 



VFORM fin 



:H V vp 



VFORM psp 



MOD | LOC | CAT | ASPECT (lex_state 

V progressive 

V activity]^ 



CONT 
for_op 



SPR () 
SUBJ () 

comps (DO, m} 

ASPECT □ 

DUR-UNIT [U 
DURATION [U 
MAIN_PSOA S 



QSTORE {} 



(4.134) 

(4.135) 



LOC 



CAT | HEAD det 

CONT I index sem.numOD 



m 



LOC 



CAT 



CONT RESTR 



head noun 

SPR {_) 

SUBJ {} 

COMPS {} 



part 



partng compljpartng\2\ 



m 



The comps of (|4~133|) means that "for" requires two complements: a determiner that 
introduces a number-denoting (sem_num) index (like the "two" of ( 4.132[) ), and a noun 
that introduces a Part operator whose first argument is a complete partitioning name 
(like the "day" of (p~79D). In (|4136|) , "for two days" receives (|4~T37|) . (As already 
mentioned, no number-agreement checks are made, and plural nouns are treated se- 
mantically as singular ones. Apart from phon, the sign of "days" is the same as 

Hi)-) 



(4.136) Tank 2 was empty for two days. 
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(4.137) 



PHON {for, two, days) 



SYNSEM LOG 



CAT 



prep 



PRD 
MOD 



VFORM fin 



:H V vp 



VFORM psp 



MOD | LOC | CAT | ASPECT (lex_state 

V progressive 

V activity]^ 



SPR 
SUBJ 
COMPS 
ASPECT 





m 



CONT 
for_op 



DUR-UNIT 
DURATION 
MAIN_PSOA 



day 

2 

m 



QSTORE {} 



When "tank 2 was empty" combines with its temporal-adverbial modifier "for two 
days", the @ of ( 4.137| ) becomes a feature structure that represents the Top formula 

the 



for "tank 2 was empty", i.e. ( 4. 138| ) . According to the semantics principle of ( |4.6£ 
sign of ( [4. 136 ) inherits the CONT of Q4.137| ) (where @] now represents ( 4. 138| ) ) . Hence, 
(|4136| ) is mapped to ( ^139|) . 



(4.138) Past[e v ,empty(tank2)} 

(4.139) For[day c , 2, Past[e v , empty (tank2)}} 



Following table |2,4| on page ( 4.133| ) does not allow "for ... " adverbials to modify 
point expressions (the mod|loc|cat|aspect of (4.133) cannot be point). It also does 
not allow "for ... " adverbials to modify consequent states. If "for ... " adverbials 
were allowed to modify consequent states, ( 4.140 ) would receive ( |4. 141| ) and ( |4.142| ). 



(4.140) BA737 had circled for two hours. 

(4.141) Past[el v ,Perf[e2 v ,For[hour c ,2,circling(ba737)}}} 

(4.142) For[hour c , 2, Past[el v , Perf[e2 v , circling{ba737)]]] 



(4.141) corresponds to the parse of (4.140) where "for two hours" modifies the past 
participle "circled" before "circled" combines with "had". In that case, the "for ... " 
adverbial modifies an activity, because past participles retain the aspectual class of the 
base form ("to circle" is an activity verb in the airport domain). ( 4.142; ) corresponds 
to the parse where "for two hours" modifies the whole sentence "BA737 had circled". 



CHAPTER 4. FROM ENGLISH TO TOP 171 

In that case, the "for ... " adverbial modifies a consequent state, because the "had" 
has caused the aspectual class of "BA737 had circled" to become consequent state. 
By not allowing "for ... " adverbials to modify consequent states, ( 4.142[ ) is blocked. 



This is needed, because in (4.142) two hours is the duration of a period (pointed to by 
el v ) that follows a period (pointed to by e2 v ) where BA737 was circling. This reading 
is never possible when "for ... " adverbials are used in past perfect sentences. The 



"for ... " adverbial of ( [4.140 ) can only specify the duration of the circling (a reading 



captured by ( [4.14l| )). (A similar observation is made on p. 587 of flKamp fc Reyle 93| .) 



The present treatment of "for ..." duration adverbials causes 

OH) 

to receive ( [4.144| ) . 



( 4.144 ) does not capture correctly the meaning of (p43l) , because it requires the taxi- 



ing to have been completed, i.e. BA737 to have reached gate 2. In contrast, as discussed 
in section 2.9,3; , the "for ... " adverbial of ( [4.143 ) cancels the normal implication of 



"BA737 taxied to gate 2." that the taxiing was completed. The post-processing (sec- 
tion 4.17| below) removes the Culm of ( 4.144 ), generating a formula that does not 



require the taxiing to have been completed. 

(4.143) BA737 taxied to gate 2 for five minutes. 

(4.144) For[minute c , 5, Past[e v , Culm[taxiing_to(ba737, gate2)]]] 



Duration adverbials introduced by "in" (e.g. ( 4.145| )) are treated by mapping "in" to 



a sign that is the same as ( |4.133 ), except that it allows the adverbial to modify only 



culminating activities. (The framework of this thesis does not allow "in ..." duration 
adverbials to modify states, activities, or points; see section 2.9.4[ ) 



(4.145) BA737 taxied to gate 2 in five minutes. 

This causes ( |4,145| ) to be mapped to ( [4.144! ), which correctly requires the taxiing to 
have been completed, and the duration of the taxiing (from start to completion) to be 
five minutes. (In this case, the post-processing does not remove the Culm.) 

4.12 Temporal complements of habituals 

Let us now examine more closely the status of temporal prepositional-phrases, like "at 
5:00pm" and "on Monday" in ( ^146|) - ( pl9|) . 



(4.146) BA737 departed at 5:00pm. 
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(4.147) BA737 departs at 5:00pm. 

(4.148) J.Adams inspected gate 2 on Monday. 

(4.149) J.Adams inspects gate 2 on Monday. 



(4.146) has both a habitual and a non-habitual reading. Under the non-habitual read- 
ing, it refers to an actual departure that took place at 5:00pm. Under the habitual 
reading, it means that BA737 had the habit of departing at 5:00pm (this reading is 
easier to accept if an adverbial like "in 1992" is added). In ( [4.147] ), only the habitual 
reading is possible, i.e. BA737 currently has the habit of departing at 5:00pm. (A 



scheduled-to-happen reading is also possible, but as discussed in section |2.5.l| this is 
ignored in this thesis.) Similar comments apply to ( 4.148 ) and ( 4.149| ). 



To account for the habitual and non-habitual readings of "to depart" in ( [4.146 ) and 
( 4.147 ), the base form of "to depart" is given the signs of ( ETToOl ) and $~15l\ ). These cor- 
respond to what chapter || called informally the habitual and non-habitual homonyms 
of "to depart". ( 4.150| ) classifies the habitual homonym as (lexical) state, while ( 4.151 ) 



classifies the non-habitual homonym as point (this agrees with table 2A on page |26[) . 
According to ( 4.150|) , the habitual homonym requires an "at ..." prepositional phrase 
that specifies the habitual departure time (this is discussed further below). In contrast, 



the non-habitual homonym of ( 4.151 ) requires no complement. 



(4.150) 



phon (depart) 



SYNSEM 



LOC 



CAT 



VFORM bse 
AUX 



verb 

ASPECT lex_state 



SPR 
SUBJ 

COMPS 





{ NP 
( PP 



-PRD 



flight-en 
-PRD, PFORM at 



m,inute^gappy\ 



CONT 



hab_departs_at 



ARGl CD 
ARG2 m 



QSTORE {} 
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(4.151) 



phon (depart) 



SYNSEM 



LOC 



CAT 



VFORM bse 
AUX 



CONT 



HEAD 

verb 
ASPECT point 

SPR {) 
SUBJ { NP 

COMPS () 

ARGl □ 

actLdepart - 



f light_enl\ 



QSTORE {} 

In the airport domain, there are actually two habitual signs for "to depart'' ', one where 
"to depart" requires an "at ... " prepositional-phrase complement (as in ( 4.150Q ), and 
one where "to depart" requires a "from ... " prepositional-phrase complement (this is 
needed in ( gl5f )). There are also two non-habitual signs of "to depart", one where "to 



depart" requires no complement (as in (4.151)), and one where "to depart" requires a 
"from ... " prepositional-phrase complement (needed in ( 4.153| )). For simplicity, here 
I ignore these extra signs. 

(4.152) BA737 (habitually) departs from gate 2. 

(4.153) BA737 (actually) departed from gate 2. 



( [4.150 ), (4.151), and the simple-past lexical rules of section 4.7.1 give rise to two signs 
(a habitual and a non-habitual one) for the simple past "departed". These are the 
same as (4.150) and ( |4.151| ), except that they contain additional Past operators. In 



contrast, the simple-present lexical rule of section |4.7.1| generates only one sign for 
the simple present "departs". This is the same as ( 4.150D , except that it contains an 
additional Pres operator. No simple-present sign is generated from ( |4.15l| ), because 
the simple-present lexical rule requires the aspect of the base sign to be state. 



The non-habitual simple-past sign of "departed" ', the "at" sign of (pl0|) , and the 



"5:00pm" sign of ( 4.111 ), cause ( 4.146|) to be mapped to ( 4.154 ), which expresses the 
non-habitual reading of ( [4.1461) . In this case, "at 5:00pm" is treated as a temporal- 
adverbial modifier of "BA737 departed", as discussed in section |4.11.1| . 

(4.154) Part[5:00pm, fv v ] A At[fv v , Past[e v , act_depart(ba737)]] 



In the habitual reading of ( 4.146| ), where the habitual sign of "departed" (derived 
from ( 4.150|) ) is used, "at 5:00pm" is treated as a prepositional-phrase complement of 
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"departed". In this case, the sign of "at" that introduces non-predicative prepositional- 
phrase complements (i.e. (4.59)) is used. The intention is to map ( 4.146| ) to ( 4.155] ), 
where 5:00pm is a constant acting as a "generic representative" of 5:00pm minutes 
(section pU8|) . 

(4.155) Past[e v , hab_departs_at(ba737, 5:00pm)} 



The problem is that in this case the "5:00pm" sign of ( 4.111 ) cannot be used, because 
it inserts a Part operator in qstore. The semantics principle would cause this Part 
operator to be inherited by the sign of the overall ( 4.146| ), and thus the Part operator 
would appear in the resulting formula. In contrast, ( 4,155| ) (the intended formula for 
( 4.146|) ) contains no Part operators. To solve this problem, one has to allow an extra 



sign for "5:00pm", shown in ( 4.156 ), which does not introduce a Part. Similarly, an 



extra "Monday" sign is needed in ( 4.148| ). (The fact that these extra signs have to 
be introduced is admittedly inelegant. This is caused by the fact that "at 5:00pm" 



is treated differently in (4.154) and (4.155); see also the discussion in section 3.18 



The "5:00pm" sign of (|T5|), the "at" sign that is used when "at" introduces non- 
predicative prepositional-phrase complements (i.e. ( [4.59 )), and the habitual "departed" 
sign (derived from ( 4.150 )) cause ( |4.146 ) to be mapped to ( 4.155 ). 



(4.156) 



PHON (5:00pm) 



SYNSEM LOC 



CAT 



CONT 
nom_obj 



HEAD 

noun 

SPR 



PRD 





SUBJ () 
COMPS () 

index 5:00pm 

RESTR {} 



QSTORE {} 



The habitual "departed" sign (which derives from (|4~T50|) ) requires the index of the 
prepositional-phrase complement to be of sort minute_gappy. As wanted, this does 



not allow the "5:00pm" sign of ( 4.111 ) (the one that introduces a Part) to be used in 
the prepositional-phrase complement of the habitual "departed", because if (|4.11l| ) is 
used, the index of the prepositional phrase will be of sort minute-ent-var, which is not 



a subsort of minute-gappy (see figure iA on page |126| ). In contrast, ( 4.156 ) introduces 
an index of sort 5:00pm, which is a subsort of minute-gappy, and hence that sign can 
be used in the complement of the habitual "departed" . 
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The treatment of the simple present ( [4.147| ) is similar. In this case, the habitual simple 
present sign (that is derived from ( f4.15C| )) is used, and ( f4.147j ) is mapped to (4.157). No 



Top formula is generated for the (impossible) non-habitual reading of (4.147), because 



there is no non-habitual sign for the simple present "departs" (see comments above 
about the simple present lexical rule). 

(4.157) Pres[hab-departs-at(ba737 ', 5:00pm)] 

4.13 Fronted temporal modifiers 



As discussed in section 4.11, in this thesis temporal-adverbial modifiers (e.g. "at 



5:00pm" in (fL15S| ) - (pl9|) , "on Monday" in ( gl6CD - flOjjjD ) can modify either 



whole finite sentences or past participle verb phrases. 

(4.158) BA737 entered sector 2 at 5:00pm. 

(4.159) At 5:00pm BA737 entered sector 2. 

(4.160) Tank 2 was empty on Monday. 

(4.161) On Monday tank 2 was empty. 

In HPSG, the order in which a modifier and the constituent to which the modifier 
attaches can appear in a sentence is controlled by the "constituent-ordering princi- 
ple" (Cop). This is a general (and not fully developed) principle that controls the 
order in which the various constituents can appear in a sentence (see chapter 7 of 
[ Pollard fe Sag 87[ ). This thesis uses an over-simplified version of Cop, that places no 



restriction on the order between temporal modifiers and modified constituents when 
the modified constituents are sentences. This allows "at 5:00pm" to either follow 
"BA737 entered sector 2" (as in ( [4. 158] )), or to precede it (as in ( [4.159] )), Similarly, 
"on Monday" may either follow "tank 2 was empty" (as in ( 4.160D ), or precede it (as 



in ( |4.161D ),P| When temporal modifiers attach to past-participle verb phrases, how- 
ever, I require the modifiers to follow the verb phrases, as in fl£l63) . This rules out 
unacceptable sentences like ( |4.163| ), where "at 5:00pm" precedes the "entered sector 
2".Q 



An alternative approach is to allo w temporal modifi ers to participate in unbounded dependency 



constructions; see pp. 176 - 181 of [Pollard & Sag 94]. 
10 Constituent-ordering restrictions are enforced in the Ale grammar of the prototype Nlitdb in 
a rather ad hoc manner, which involves partitioning the synsem sort into pre_mod_synsem and 
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This approach causes ( [1. 164 ) to receive only ( |4.165| ), because in ( [4.164 ) "at 5:00pm" can 



modify only the whole "BA737 had entered sector 2" (it cannot modify just "entered 
sector 2" because of the intervening "BA737 had"). In ( |4.165| ), 5:00pm is a reference 
time, a time where the entrance had already occurred. In contrast, ( |4,166] ) receives 
both (|4.165|) and ([4.1671 ), because in that case "at 5:00pm" can modify either the whole 



"BA737 had entered sector 2" or only "entered sector 2". In (|4.167| ), 5:00pm is the 
time where the entrance occurred. 

(4.164) At 5:00pm [BA737 had entered sector 2}. 

(4.165) Part[5:00pm, fv v ] A At[fv v , Past[el v , Perf [e2 v , enter (ba737, sector2)\\\ 

(4.166) BA737 had entered sector 2 at 5:00pm. 

(4.167) Part[5:00pm, fv v ] A Past[el v , Perf[e2 v , At[fv v , enter(ba737, sector2)]]] 



The fact that fl4Tl64j) does not receive (|4.167[ ) does not seem to be a disadvantage 



because in ( 4.164| ) the reading where "at 5:00pm" specifies the time of the entrance 



seems unlikely (or at least much more unlikely than in ( 4.166| )). 

4.14 Temporal subordinate clauses 



I now discuss temporal subordinate clauses (section [2.10 ), focusing on "while 



clauses. The treatment of "before ... " and "after ... " clauses is very similar. 

As with period adverbials, "while ... " clauses are treated as temporal modifiers of 
finite sentences or past participle verb phrases. As with prepositions introducing period 



adverbials, "while" is given two signs. The first one, shown in (4.168), introduces an 
End operator, causes an aspectual shift to point, and can be used only with culminating 
activity main clauses (( 4. 168| ) is similar to ( 4. 127| ) .) The second one is the same as 



(4.168), except that it does not introduce an End, it preserves the aspectual class 
of the main clause, and it can be used with main clauses of any aspectual class. In 
both cases, "while" requires as its complement a finite sentence whose aspect must not 



post_rnod_synsem, and using feature structures from the two subsorts as values of MOD to signal that 
the modifier can only precede or follow the modified constituent. This idea was borrowed from a 
grammar written by Suresh Manandhar. 
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be consequent state (this agrees with table 2.6 on page |5^, which does not allow the 
aspectual class of the "whiie"-clause to be consequent state). 



(4.168) 



PHON 



(while) 



SYNSEM LOC 



CAT 



MOD 



VFORM fin 



:U V vp 



MOD LOC CAT ASPECT 



VFORM psp 
culmact 



CONT 
at_op 



SPR 













SUBJ 

















VFORM 


fin 




COMPS 


<s 












ASPECT 


-^cnsqstate 


ASPECT 


point 








TIME.SPEC 


m 






MAIN_PSOA 




MAIN 


-psoa m 








end 











QSTORE {} 



The [D in ( |4.168| ) denotes the CONT of the sign of the complement of "while" (the 
subordinate clause). The two "while" signs cause ( 4. 169|) to receive ( p7C|) and ( p7l| ). 
("To land" is a culminating activity verb in the airport domain). ( [4.170 ) requires the 



landing to have simply been completed during the inspection, while ( 4.171 ) requires 
the landing to have both started and been completed during the inspection. 

(4.169) UK160 landed while J.Adams was inspecting BA737. 

(4.170) At[Past[el v , inspecting (j _adams , 6a737)], 

End[Past[e2 v , Culm[landing{occr v , itfcl60)]]]] 

(4.171) At[Past[eV , inspecting (j_adams, 6a737)], 

Past[e2 v , Culm[landing(occr v , uklGO)]]] 

Since "while ..." clauses are treated as temporal modifiers, the ordering arrangements 
of section |4.13| apply to "while ... " clauses as well. Hence, "while ... " clauses can 
either precede or follow finite sentences (e.g. ( 4.172 ), ( 4.173|) ). 



(4.172) UK160 arrived while J.Adams was inspecting BA737. 

(4.173) While J.Adams was inspecting BA737, UK160 arrived. 



One problem with the present treatment of "while ..." clauses is that it maps ( |4,174| ) 
to (|4.175|) , which requires the inspection to have been completed. This does not agree 
with table |2.6| on page 5£ , according to which any requirement that the situation of a 
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culminating activity sentence must have been reached is cancelled when the sentence is 
used as a "while ... " clause. To overcome this problem, the post-processing (section 
4l7| below) removes any Culm operators that are within first arguments of At opera- 
tors. This removes the Culm of (4.175), generating a formula that no longer requires 
the inspection to have been completed. 

(4.174) UK160 departed while J.Adams inspected BA737. 

(4.175) At[Past[el v , Culm[inspecting(jjadams,ba7S7)]], 

Past[e2 v , [actl_depart{ukim)}}\ 



4.15 Interrogatives 



So far, this chapter has considered mainly assertions (e.g. ( 4.176|) ). (The reader is 
reminded that assertions are treated as yes/no questions; e.g. ( [4.176; ) is treated as 
( 4.1770 .) I now explain how the Hpsg version of this thesis copes with questions (e.g. 



d4l77D - flO|D ) 



(4.176) Tank 2 was empty. 

(4.177) Was tank 2 empty? 

(4.178) Did J.Adams inspect BA737? 

(4.179) Which tank was empty? 

(4.180) Who inspected BA737? 

(4.181) What did J.Adams inspect? 

(4.182) When did J.Adams inspect BA737? 



Yes/no questions (e.g. ( [4.177 ), ( 4.178| )) constitute no particular problem. Hpsg's 
schemata allow auxiliary verbs to be used in sentence-initial positions, and cause ( f4.177| ) 
to receive the same formula (shown in (4.184)) as ( 4.183j ). In both cases, the same lex- 
ical signs are used. Similar comments apply to ( 4.17$ ) and ( 4.185f ), which are mapped 
to Q4. 186|) . 

(4.183) Tank 2 was empty. 

(4.184) Past[e v , empty (tank2)} 

(4.185) J.Adams did inspect BA737. 
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The interrogative "which" is treated syntactically as a determiner of (non-predicative) 
noun phrases. The sign of "which" is the same as the sign of "the" of (|4.65| ), except that 
it introduces an interrogative quantifier rather than an existential one. For example, 
( 4.179 ) is analysed syntactically in the same way as ( |4.187j ) (punctuation is ignored). 
However, the formula of ( 4.179j ) (shown in ( 4.18S| )) contains an additional interrogative 
quantifier (cf. the formula of ( [4.1871 ), shown in (|4.188| )). (I assume here that "tank" 
does not introduce an Ntense. The "a" of ( [4.187] ) introduces an existential quantifier 
which is removed during the extraction of (|4.188| ) from the sign of ( [4.187 ), as discussed 
in section 



(4.187) A tank was empty. 

(4.188) tank(tk v ) A Past[e v , empty(tk v )) 

(4.189) ?tk v tank(tk v ) A Past[e v , empty{tk v )} 



The interrogative "who" is treated syntactically as a non-predicative noun-phrase. Its 
sign, shown in ( 4.190| ), introduces an interrogative quantifier. 



(4.190) 



PHON 



(who) 



SYNSEM LOG 



CAT 



CONT 
nom_obj 



HEAD 

noun 
SPR 

SUBJ {) 
COMPS {) 



INDEX 

person_ent 
RESTR {} 



TVAR + 



m 



QSTORE 



quant L 



det interrog 
RESTIND 



(4.180) is analysed syntactically in the same way as ( |4.191| ). The sign of "who", 
however, gives rise to an interrogative quantifier in the formula of ( 4.180| ) (shown 
in ( |4.193[ )), which is not present in the formula of ( 4.191| ) (shown in ( |4. 192| ) ) . The 
interrogative "what" is treated similarly. 



(4.191) J.Adams inspected BA737. 

(4.192) Past[Culm[inspecting(occr v , j_adams, 6a737)]] 
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(4.193) ?wh v Past[Culm[inspecting(occr v ,wh v , 6a737)]] 



The Hpsg version of this thesis admits questions like ( [4.194 ), which are unacceptable 
in most contexts. ( |4.194| ) is licensed by the same syntactic analysis that allows ( 4.195| ), 
and receives the same formula as ( [4.196 ). 



(4.194) ?Did J.Adams inspect which flight? 

(4.195) Did J.Adams inspect a flight? 

(4.196) Which flight did J.Adams inspect? 



Questions like ( [4.196 ), where the interrogative refers to the object of the verb, are 
treated using Hpsg's unbounded-dependencies mechanisms (more precisely, using the 
SLASH feature; see chapter 4 of [[Pollard Sag 94| ] ) .P^] Roughly speaking, ( [4.196| ) is 



analysed as being a form of (4.194), where the object "which flight" has moved to 
the beginning of the question. Hpsg's unbounded-dependencies mechanisms will not 
be discussed here (see [ pollard fc Sag 94 1; the prototype Nlitdb uses the traceless 
analysis of unbounded dependencies, presented in chapter 9 of [ Pollard Sz Sag 94| ). 



The present treatment of interrogatives allows questions with multiple interrogatives, 
like (4.197) which receives (4.198). (( [4.197 ) is parsed in the same way as ( [4.199j ).) 
Unfortunately, it also allows ungrammatical questions like ( 4.200[ ), which is treated as 
a version of ( |4.197| ) where the "what" complement has moved to the beginning of the 
sentence. ( Q4.200D receives ( [4.197| ).) 



(4.197) Who inspected what. 

(4.198) 1wl v ?w2 v Past[e v , Culm[inspecting(occr v ,wl v ,w2 v )}} 

(4.199) J.Adams inspected BA737. 

(4.200) *What who inspected. 



The interrogative "when" of ( 4.201 ) is treated as a temporal-adverbial modifier of finite 
sentences. ( 4.203j ) shows the sign of "when" that is used in ( 4.201 ). ( 4.203|) causes 
( 4.201 ) to receive (|4202|) . 



11 Pollard and Sag also reserve a QUE feature, which is supposed to be used in the treatment of 
interrogatives . They provide virtually no information on the role of QUE, however, pointing to 



Ginzburg 92 ] where QUE is used in a general theory of interrogatives. Ginzburg's theory is intended 
to address issues well beyond the scope of this thesis (e .g. the relatio n between a que stion and the 
facts that can be said to resolve that question; see also [Ginzburg 95a], [Ginzburg 95b|). QUE is not 
used in this thesis. 
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(4.201) When was tank 2 empty? 



(4.202) 
(4.203) 



mx iw v Past[e v , empty (tank2)] 
phon (when) 



SYNSEM LOC 



HEAD 



MOD 



VFORM fin 



m 



MOD LOC CAT ASPECT 



m 



QSTORE 



CAT SPR () 

SUBJ () 

COMPS (} 

ASPECT [U 
CONT S 

det interrogjmxl 

INDEX 



RESTIND 



temp_ent 
RESTR {} 



TVAR + 



( 4.203 ) introduces interrogative-maximal quantifiers whose variables (w v in ( |4.202| )) 
do not appear elsewhere in the formula. The post-processing (to be discussed in sec- 



tion 4.17 ) replaces the variables of interrogative- maximal quantifiers by variables that 



appear as first arguments of Past or Perf operators. In (4.202), this would replace w v 
by e v , generating a formula that asks for the maximal past periods where tank 2 was 
empty. 



There is also a second sign for the interrogative "when" (shown in ( 4.206| )), that is 
used in habitual questions like ( 4.204| ). In ( [4.204| ) , "when" is taken to play the same 
role as "at 5:00pm" in fl4.205| ), i.e. it is treated as the prepositional-phrase complement 
of the habitual "depart" (see section [4.12 ), which has moved to the beginning of the 
sentence via the unbounded-dependencies mechanisms. 



(4.204) When does BA737 depart (habitually)? 

(4.205) Does BA737 depart (habitually) at 5:00pm? 
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(4.206) 



PHON 



{whe 



SYNSEM LOG 



CAT 



CONT 
nom_obj 



HEAD 



PRD 



prep 
SPR () 
SUBJ () 
COMPS () 

INDEX 

gappyjpartng 
RESTR {} 



TVAR + 



QSTORE 



det interrog 

RESTIND 



In the simple past ( 4,2071 ), both the (state) habitual homonym of "to depart" (that 
of ( fTl5C| ), which requires a prepositional phrase complement) and the (point) non- 



habitual homonym (that of ( 4.151 ), which requires no complement) can be used. Hence, 
"when" can be either a prepositional-phrase complement of the habitual "depart" (us- 
ing (|4.206|) ), or a temporal modifier of the non-habitual sentence "did BA737 depart" 
(using fl4.203|) ). This gives rise to ( |4.208| ) and ( |4.209| ), which correspond to the habit- 
ual and non-habitual readings of ( [4.207| ) (the w v of ([4.209 ) would be replaced by e v 
during the post-processing). 

(4.207) When did BA737 depart? 

(4.208) ?w v Past[e v , hab.departs.at(ba737, w v )] 



(4.209) 



^mxiw v Past[e v , actl-depart(ba737)] 



4.16 Multiple temporal modifiers 



The framework of this thesis currently runs into several problems in sentences with 
multiple temporal modifiers. This section discusses these problems. 



Both preceding and trailing temporal modifiers: Temporal modifiers are al- 
lowed to either precede or follow finite sentences (section [4. 13 ). When a finite sentence 
is modified by both a preceding and a trailing temporal modifier (as in ( 4.210| )), two 
parses are generated: one where the trailing modifier attaches first to the sentence (as 
m ( jgllD ), and one where the preceding modifier attaches first (as in ( |4.213[ )). In most 
cases, this generates two semantically equivalent formulae ((4.212) and (4.214) in the 
case of ( [4.210] )) . A mechanism is needed to eliminate one of the two formulae. 



CHAPTER 4. FROM ENGLISH TO TOP 183 

(4.210) Yesterday BA737 was at gate 2 for two hours. 

(4.211) Yesterday [[BA737 was at gate 2] for two hours.] 

(4.212) At[y ester day, For[hour c , 2, Past[e v , located_at(ba737, gate2)]]] 

(4.213) [Yesterday [BA737 was at gate 2]] for two hours. 

(4.214) For[hour c ,2,At[yesterday,Past[e v ,located.at(ba737,gate2)}]} 



Multiple temporal modifiers and anaphora: Another problem is that a question 
like ([42151 ) is mapped to ( gl21tj ). (I assume here that "flight" does not introduce an 



Ntense.) The problem with ( |4.216| ) is that it does not require fv v to be the particular 
5 :00pm-minute of 2/11/95. ( [QT| ) requires the flight to have arrived on 2/11/95 and 



after an arbitrary 5:00pm-minute (e.g. the 5:00pm-minute of 1/11/95). In effect, this 
causes the "after 5:00pm" to be ignored. 

(4.215) Which flight arrived after 5:00pm on 2/11/95? 

(4.216) ?/r flight(fF) A Part[5:00pm9 , fv v }A 
At[2/ll/95,After[fv v , Past[e v , arrive(fl v )]}] 



This problem seems related to the need for temporal anaphora resolution mecha- 
nisms (section |2.12j ). In ( 14.2171 ), for example, the user most probably has a particular 
(contextually-salient) 5:00pm-minute in mind, and an anaphora resolution mechanism 
is needed to determine that minute. A similar mechanism could be responsible for 



reasoning that in ( 4.215 ) the most obvious contextually salient 5 :00pm- minute is that 
of 2/11/95. 

(4.217) Which tanks were empty before/at /after 5:00pm? 



Culminating activity with both punctual and period adverbial: A further 
problem appears when a culminating activity is modified by both a punctual and a 
period adverbial.p| The problem is that, unlike what one would expect, (4.218) and 



(|42l9D do not receive equivalent Top formulae. (I assume here that "to repair" is 



classified as culminating activity verb.) 

(4.218) J.Adams repaired fault 2 at 5:00pm on 2/11/95. 

12 The problems of this section that involve period adverbials also arise when temporal subordinate 
clauses are used instead of period adverbials. 
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(4.219) J.Adams repaired fault 2 on 2/11/95 at 5:00pm. 



In ( [4.218 ), the punctual adverbial "at 5:00pm" modifies the culminating activity sen- 
tence "J.Adams repaired fault 2". The punctual adverbial causes "J.Adams repaired 
fault 2 at 5:00pm" to become a point (see table 2/2 on page fl4|). Two formulae are gen- 
erated: one that requires the repair to have started at 5:00pm, and one that requires 
the repair to have been completed at 5:00pm. "On 2/11/95" then modifies the point 
expression "J.Adams repaired fault 2 at 5:00pm". This leads to (|4.220Q and (|4.221|) . In 
( 4.220 ) the repair starts at the 5:00pm-minute of 2/11/95, while in (|4.221 ) the repair 
is completed at the 5:00pm-minute of 2/11/95. (The first reading is easier to accept in 
"J.Adams inspected BA737 at 5:00pm on 2/11/95".) 



(4.220) Part[5:00pm\ fv v ] A At[2/ll/95, 

At[fv v , Begin[Past[e v , Culm[repairing(occr v , j _adams , /au/t2)]]]j] 

(4.221) Part[5:00pm9, fv v ] A At[2/ll/95, 

At[fv v , End[Past[e v , Culm[repairing(occr v , j _adams, fault2)]]]\] 

(A digression: this example also demonstrates why punctual adverbials are taken to 



trigger an aspectual shift to point; see section |2.9.1| . Without this shift, the aspectual 
class of "J.Adams repaired fault 2 at 5:00pm" would be culminating activity, and the 
"on" signs of section [4.11.2| would lead to the additional formulae of ( [4,222] ) and ( |4.223| ) . 
These are equivalent to ([4.220 ) and ( 4.221 ) respectively.) 



(4.222) Part[5:00pm9 , fv v ] A At[2/ll/95, 

End[At[fv v , Begin[Past[e v , Culm[repairing(occr v , j _adams, fault2)]}]\\\ 

(4.223) Part[5:00pm9, fv v ] A At[2/ll/95, 

End[At[fv v , End[Past[e v , Culm[repairing(occr v , j_adams, fault2)]]] j j 



In ( 4.219| ), "J.Adams repaired fault 2" is first modified by the period adverbial "on 
2/11/95". Two formulae (shown in ( [1.224! ) and (|4.225|) ) are generated. ( [4.224! ) requires 



the repair to simply reach its completion on 2/11/95, while ( 4.225 ) requires the repair 
to both start and reach its completion on 2/11/95. In the first case (where ( gTggg ) 
is generated), the aspectual class of "J.Adams repaired fault 2 on 2/11/95" becomes 
point, while in the other case the aspectual class remains culminating activity (see also 



table 2.3 on page n{ 



(4.224) At[2/ 11/95 , End[Past[e v , Culm[repairing(occr v ,j_adams, fault2)]}]\ 
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(4.225) At[2/ll/95, Past[e v , Culm[repairing(occr v , j_adams, fault2)]}] 



In the case of (4.224), where the aspectual class of "J. Adams repaired fault 2 on 
2/11/95" is point, the signs of section [4.11.1 lead to ( |4.226| ), while in the case of 
(|4.225|) , they lead to (|4.227|) and (|4.228|) . 



(4.226) Part[5:00pm9, fv 1 '} A At[fv v , 

At[2/ll/95 , End [Past [e v , Culm, [repair ing(occr v , j_adams, fault2)]]] \ 

(4.227) Part[5:00pm9] AAt[fv v , 

Begin [At[2/ 11/95 , Past [e v , Culm [repairing(occr v , j_adams, fault2)]]] j 

(4.228) Part[5:00pm9] AAt[fv v , 

End [At[2/ll/ 95 , Past [e v , Culm [repairing(occr v , j_adams, fault2)]]] j 



Hence, (4.218) receives two formulae (( 4.220| ) and ( [4.221 )), while ( [4.21S| ) receives three 
(( [4226D - ( gjgg )). ( |4226D is equivalent to ( |3~2"2"lD . They both require the repair to 
reach its completion within the 5:00pm-minute of 2/11/95. Unlike what one might 
expect, however, ( [4.227| ) is not equivalent to ( 4.220|) , ( [4.2271 ) requires a past period 
that covers exactly the whole repair (from start to completion) to fall within 2/11/95, 
and the beginning of that period to fall within some 5:00pm-minute. This means that 
the repair must start at the 5:00pm-minute of 2/11/95 (as in ( 4.220| )), but it also means 
that the repair must reach its completion within 2/11/95 (this is not a requirement in 
Q4.220D ). Also, unlike what one might expect, ( |4.228| ) is not equivalent to ( [4.221| ) and 
flOgg ). (P~228|) requires the repair to reach its completion within the 5:00pm-minute 
of 2/11/95 (as in fljjUjj ) and fljjgg )), but it also requires the repair to start within 
2/11/95 (which is not a requirement in (|4.221 ) and ( 4.226| )). 



The differences in the number and semantics of the generated formulae in ( [4.218| ) and 



( 4.219 ) lead to differences in the behaviour of the Nlitdb that are difficult to explain 
to the user. A tentative solution is to adopt some mechanism that would reorder the 
temporal modifiers, so that the punctual adverbial attaches before the period one. This 



would reverse the order of "on 2/11/95" and "at 5:00pm" in ( 4.219 ), and would cause 
(4.219) to be treated in the same way as Q4.218| ) (i.e. to be mapped to ( [4,220| ) and 
( 4.221 ); these seem to capture the most natural readings of ( |4.21g| ) and (|4.219|) ). 



Culminating activity and multiple period adverbials: A further problem is 
that a sentence like Q4.229j ), where a culminating activity is modified by two period 
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adverbials, receives three formulae, shown in ( |4.230| ) - (|4232|). It turns out that ( p~23ll ) 
is equivalent to (4.232), and hence one of the two should be eliminated. 



(4.229) J.Adams repaired fault 2 in June in 1992. 

(4.230) Part[june3 , f] A A*[1992, 

At[j v , End[Past[e v , Culm[repairing(occr v , j_adams, fault2)]]]\\ 

(4.231) Part[june3,j v ] A A£[1992, 

End[At[j v , Past[e v , Culm[repairing(occr v , j_adams, fault2)]]]\\ 

(4.232) Part[june3 , f] A ,4* [1992, 

At[j v , Past[e v , Culm[repairing(occr v ,j_adams, fault2)]]]] 



A period adverbial combining with a culminating activity can either insert an End 
operator and cause an aspectual shift to point, or insert no End and leave the aspectual 



class unchanged (see section 4.11.2 ). In the case where ( 4.230 ) is generated, "in June" 
inserts an End and changes the aspectual class to point. This does not allow "in 1992" 
(which attaches after "in June") to insert an End, because period adverbials combining 



with points are not allowed to insert Ends (the "on" sign of (4.127) cannot be used 
with points). In the cases where fl423lD or ( ^23p are generated, "in June" does not 
insert an End, and the aspectual class remains culminating activity. "In 1992" can 
then insert an End (as in (4.231)) or not (as in (4.232)). (4.232) requires the whole 
repair to be located within a June and 1992 (i.e. within the June of 1992). ( [4.230| ) is 
weaker: it requires only the completion point of the repair to be located within the 



June of 1992. Finally, (4.231) requires the whole of the repair to be located within a 
June, and the completion point of the repair to fall within 1992. This is equivalent 
to requiring the whole of the repair to fall within the June of 1992, i.e. (4.231) is 
equivalent to (4.232), and one of the two should be eliminated. 



4.17 Post-processing 



The parsing maps each English question to an Hpsg sign (or multiple signs, if the parser 
understands the question to be ambiguous). From that sign, a Top formula is extracted 



as discussed in section 4.6. The extracted formula then undergoes an additional post- 
processing phase. This is a collection of minor transformations, discussed below, that 
cannot be carried out easily during the parsing. 
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Removing Culms: ( |4.234| ) shows the Top formula that is extracted from the sign 
of ( 4.233| ). As discussed in section 4.11.3 , ( 4.234 ) does not represent correctly ( 4.233j ), 
because ( |423^) requires the taxiing to have been completed. In contrast, as discussed 



in section 2.9.2, the "for ... " adverbial of (4.233) cancels the normal implication of 
"BA737 taxied to gate 2" that the taxiing must have been completed. To express 
correctly ( 4.233| ), the Culm of ( 4.234j ) has to be removed. 



(4.233) BA737 taxied to gate 2 for five minutes. 

(4.234) For[minute c , 5, Past [e v , Culm[taxiing_to(ba737, gate2)]]] 

A first solution would be to remove during the post-processing any Culm operator that 
is within the scope of a For operator. The problem with this approach is that duration 
"in ..." adverbials also introduce For operators (see section [4.11.3 ), but unlike "for . . . 
" adverbials they do not cancel the implication that the completion must have been 



reached. For example, the formula extracted from the sign of (4.235) is (4.234). In 
this case, ( f4.234| ) is a correct rendering of ( 4.235| ) (because ( [4.235 ) does imply that 
BA737 reached gate 2), and hence the Culm operator should not be removed. To 
overcome this problem, the prototype Nlitdb attaches to each For operator a flag 
showing whether it was introduced by a "for ... " or an "in ... " adverbial. Only For 
operators introduced by "for ... " adverbials cause Culm operators within their scope 
to be removed. 

(4.235) BA737 taxied to gate 2 in five minutes. 



The post-processing also removes any Culm operator from within the first argument 



of an At operator. As explained in section 4T4 , this is needed to express correctly 
"while ... " clauses. 



? mx i quantifiers: As noted in section 4.15| , before the post-processing the variables 
of interrogative- maximal quantifiers introduced by "when" do not occur elsewhere in 
their formulae. For example, ( 4.237 ) and ( 4,238] ) are extracted from the signs of ( g23| ) 



and (4.238|). In both formulae, w v occurs only immediately after the ? 



mil ■ 



(4.236) When was J.Adams a manager? 

(4.237) r ?mxiw v Past[e v , manager (j_adams)} 

(4.238) When while BA737 was circling was runway 2 open? 
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(4.239) ? mx iw v At [Past[er, circling(ba737)], Past[e2 v , open(runway2)}} 



During the post-processing, the variables of interrogative-maximal quantifiers are re- 
placed by variables that appear as first arguments of Past or Perf operators, excluding 
Past and Perf operators that are within first arguments of At, Before, or After oper- 
ators. In (|4.237|) , this causes w v to be replaced by e v . The resulting formula asks for 



the maximal past periods where J.Adams was a manager. Similarly, the w v of flOgD 



is replaced by e2 v . The resulting formula asks for the maximal past periods e2 v , such 
that runway 2 was open at e2 v , and e2 v is a subperiod of a period eV where BA737 was 
circling. In ( 4.239|) , w v cannot be replaced by eV , because Past[el v , circling (6a737)] 



is within the first argument of an At. 

Past and Perf operators located within first arguments of At, Before, or After op- 
erators are excluded, to avoid interpreting "when" as referring to the time where the 
situation of a subordinate clause held (formulae that express subordinate clauses end- 
up within first arguments of At, Before, or After operators). The interrogative "when" 



always refers to the situation of the main clause. For example, ( 4.238 ) cannot be asking 
for maximal periods where BA737 was circling that subsume periods where runway 2 
was open (this would be the meaning of ( [4.239| ) if w v were replaced by eV). 

When the main clause is in the past perfect, this arrangement allows the variable of 
^mxi t° be replaced by either the first argument of the main-clause's Past operator, 



or the first argument of the main-clause's Perf operator. (4.241), for example, shows 



the formula extracted from the sign of ( 4.240Q . The post-processing generates two 



formulae: one where w v is replaced by eV , and one where w v is replaced by e2 v . The 



first one asks for what section 2.9.1 called the "consequent period" of the inspection 
(the period from the end of the inspection to the end of time). The second one asks 
for the time of the actual inspection. 

(4.240) When had J.Adams inspected BA737? 
(4.241) 

^mxi wV Past\eV , Perf[e2 v , Culm[inspecting(occr v , j _adams , 6a 737)]]] 



Ntense operators: As noted in section 4.9.1, when extracting Top formulae from 



signs, if an Ntense operator is encountered and the sign contains no definite indication 
that the first argument of the Ntense should be now*, in the extracted formula the first 
argument of the Ntense becomes a variable. That variable does not occur elsewhere 
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in the extracted formula. Assuming, for example, that the (non-predicative) "queen" 
introduces an Ntense, the formula extracted from the sign of ( [4.242 ) is ( 4.243| ) . The 
t v of the Ntense does not occur elsewhere in ( 4.243| ). 



(4.242) The queen was in Rome. 

(4.243) Ntense[t v ,queen(q v )] A Past[e\ v , located jat(q v ,rome)} 

During the post-processing, variables appearing as first arguments of Ntenses give rise 
to multiple formulae, where the first arguments of the Ntenses are replaced by now* 
or by first arguments of Past or Perf operators. In ( 4.243] ), for example, the post- 



processing generates two formulae: one where t v is replaced by now* (queen at the 
speech time), and one where t v is replaced by e v (queen when in Rome). 



In ( 4.245| ) (the formula extracted from the sign of ( [4.244[ )), there is no Past or Perf 



operator, and hence t v can only become now*. This captures the fact that the "queen" 
in ( [4.244 ) most probably refers to the queen of the speech time. 



(4.244) The queen is in Rome. 

(4.245) Ntense[t v , queen(q v )] A Pres[located.at(q v , gate2)] 



In ( 4.247| ) (the formula extracted from the sign of ( [4.246] )), the post-processing leads 



to three formulae, where t v is replaced by now* (queen at speech time), e2 v (queen 
during the visit), or el v (queen at a "reference time" after the visit). 

(4.246) The queen had visited Rome. 

(4.247) Ntense[t v ,queen(q v )] A Past[er,Perf[e2 v ,visiting(q v ,rome)}] 



4.18 Summary 

This chapter has shown how Hpsg can be used to translate English questions directed 
to a Nlitdb to appropriate Top formulae. During the parsing, each question receives 
one or more HPSG signs, from which TOP formulae are extracted. The extracted 
formulae then undergo an additional post-processing phase, which leads to formulae 
that capture the semantics of the original English questions. 

Several modifications were made to Hpsg. The main modifications were: (i) Hpsg 
features and sorts that are intended to account for phenomena not examined in this 



CHAPTER 4. FROM ENGLISH TO TOP 190 

thesis (e.g. pronouns, relative clauses, number agreement) were dropped, (ii) The 
quantifier storage mechanism of Hpsg was replaced by a more primitive one, that 
does not allow quantifiers to be unstored during the parsing; the semantics principle 
was modified accordingly, (iii) An aspect feature was added, along with a principle 
that controls its propagation, (iv) The possible values of CONT and QSTORE were 
modified, to represent Top expressions rather than situation-theory constructs, (v) 
A hierarchy of world-entity types was mounted under the ind sort; this is used to 
disambiguate sentences, and to block semantically ill-formed ones, (vi) New lexical 
signs and lexical rules were introduced to cope with temporal linguistic mechanisms 
(verb tenses, temporal adverbials, temporal subordinate clauses, etc.). Apart from 



these modifications, the Hpsg version of this thesis follows closely | Pollard fe Sag 94 1. 



Chapter 5 



From TOP to TSQL2 



11 Time is money. 



5.1 Introduction 



This chapter describes the translation from Top to Tsql2. The discussion starts with 
an introduction to Tsql2 and the version of the relational model on which Tsql2 is 
based. This thesis adopts some modifications to Tsql2. These are described next, 
along with some minor alterations in the Top definition of chapter ^. The translation 
from Top to Tsql2 requires Top's model to be linked to the database; this is explained 
next. The translation is carried out by a set of rules. I explore formally the properties 
that these rules must possess for the translation to be correct, and I describe the 
intuitions behind the design of the rules. An illustration of how some of the rules work 
is also given. The full set of the translation rules, along with a proof that they possess 
the necessary properties, is given in appendix [A]. The chapter ends with a discussion 
of related work and reflections on how the generated Tsql2 code could be optimised. 



5.2 An introduction to TSQL2 



This section introduces Tsql2 and the version of the relational model on which Tsql2 
is based. Some definitions that are not part of the Tsql2 documentation are also 
given; these will be used in following sections. I note that although [Bnodgrass 95] 



defines Tsql2's syntax rigorously, the semantics of the language is defined very in- 
formally, with parts of the semantics left to the intuition of the reader. There are 
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also some inconsistencies in the Tsql2 definition (several of these were pointed out in 
[Androutsopoulos et al. 95a[ ). 



5.2.1 The traditional relational model 



As explained in section [L2.4 , the traditional relational model stores information in 
relations, which can be thought of as tables. For example, salaries below is a relation 
showing the current salaries of a company's employees, salaries has two attributes 
(intuitively, columns), employee and salary. The tuples of the relation are intuitively 
the rows of the table {salaries has three tuples). 



salaries 


employee 


salary 


J.Adams 
T.Smith 
G. Papas 


17000 
19000 
14500 



I adopt a set-theoretic definition of relations (see section 2.3 of | Ullman 8q 1 for alter- 
native approaches). A set of attributes T>a is assumed (e.g. employee and salary are 
elements of T>a). A relation schema is an ordered tuple of one or more attributes 
(e.g. {employee, salary)). A set of domains Vrj = {D\, D2, ■ ■ ■ , D no ] is also assumed. 
Each element D{ of T>£> is itself a set. For example, D\ may contain all strings, D2 all 
positive integers, etc. Each attribute (element of T>a) is assigned a domain (element 
of Dd)- D{A) denotes the domain of attribute A. D on its own refers to the universal 
domain, the union of all Di 6 T>b. 

A relation over a relation schema R = {A\, A2, ■ ■ ■ , A n ) is a subset of D{A\) x D{A2) x 
• • • x D{A n ), where x denotes the cartesian product, and D(A\), D{A2), ■ ■ ■ , D{A n ) 
are the domains of the attributes Ai, A2, ■ ■ ■ , A n respectively. That is, a relation over 
R is a set of tuples of the form ( - ui,i>2, • • • , v n ), where v\ G D{A\), V2 £ D{A2), ■ ■ ■ , 
v n S D{A n ). In each tuple {v\,V2, ■ ■ ■ ,v n ) , v\ is the attribute value of Ai, V2 is the 
attribute value of A2, etc. The universal domain D is the set of all possible attribute 
values. Assuming, for example, that employee, salary £ T>a, that D\ and D2 are as 
in the previous paragraph, and that employee and salary are assigned D\ and D2, 
r below is a relation over {employee, salary) . (r is a mathematical representation of 
salaries above.) On its own, "relation" will be used to refer to a relation over any 
relation schema. 



r = {( J.Adams, 17000), {T.Smith, 19000), {G. Papas, 14500)} 
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The arity of a relation over R is the number of attributes in R (e.g. the arity of r is 2). 
The cardinality of a relation is the number of tuples it contains (the cardinality of r is 
3). A relational database is a set of relations (more elaborate definitions are possible, 
but this is sufficient for our purposes). 

I assume that every element of D (universal domain) denotes an object in the modelled 
world. ("Object in the world" is used here with a very loose meaning, that covers 
qualifications of employees, salaries, etc.) OBJS db is the set of all the world objects 
that are each denoted by a single element of D. (Some world objects may be represented 
in the database as collections of elements of D, e.g. as whole tuples. OBJS db contains 
only world objects that are denoted by single elements of D.) I also assume that a 
function /jj:J)h OBJS db is available, that maps each element v of D to the world 
object denoted by v. fo reflects the semantics assigned to the attribute values by the 
people who use the database. In practice, an element of D may denote different world 
objects when used as the value of different attributes. For example, 15700 may denote a 
salary when used as the value of salary, and a part of an engine when used as the value 
of an attribute part-no. Hence, the value of fo should also depend on the attribute 
where the element of D is used, i.e. it should be a function fjj-Dx T>a >— ► OBJS db . 
For simplicity, I overlook this detail. 

I also assume that fo is 1-1 (injective), i.e. that every element of D denotes a different 
world object. In practice, fo may not be 1-1: the database may use two different 
attribute values (e.g. dpt3 and sales-dpt) to refer to the same world object. The Top 
to TSQL2 translation could be formulated without assuming that fo is 1-1. This 
assumption, however, bypasses uninteresting details. By the definition of OBJS db , any 
element of OBJS db is a world object denoted by some element of D. That is, for every 
o G OBJS db , there is a v G D, such that /d(v) = o, i.e. fn is also surjective. Since ju 
is both 1-1 and surjective, the inverse mapping f^ 1 is a function, and f^ 1 is also 1-1 
and surjective. 

5.2.2 TSQL2's model of time 

Like Top, Tsql2 assumes that time is discrete, linear, and bounded. In effect, Tsql2 
models time as consisting of chronons. Chronons are the shortest representable units of 
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time, and correspond to Top's time-points .[] Depending on the Tsql2 implementation, 
a chronon may represent a nanosecond, a day, or a whole century. Let us call the 
(implementation-specific) set of chronons CHRONS . Although not stated explicitly, 



it is clear from the discussion in chapter 6 of [Snodgrass 95 1 that CHRONS ^ 0, that 
chronons are ordered by a binary precedence relation (let us call it < db ), and that 
(CHRONS , -< db ) has the properties of transitivity, irreflexivity, linearity, left and right 
boundedness, and discreteness (section [T^). 

I define periods over (CHRONS, ^ db ) in the same way as periods over (PTS, -<) (section 
3.3). A period over (CHRONS , -< db ) is a non-empty and convex set of chronons. An 



instantaneous period over (CHRONS , -< db ) is a set that contains a single chronon. 
PERIODS (chrons, ^ db ) an d INSTANTS / chrons ,< dh ) are ^ ne se ^ s °f au P er i°ds and all 



instantaneous periods over (CHRONS , -< db ) respectively. In section 5.8 , I set the point 
structure (PTS, -<) of Top's model to (CHRONS, -<<*). Hence, PERIODS {PT s^) and 
INSTANTS (pts, -<) become PERIODS {CHR0NS <db) and INSTANTS {CHR0NS ^ db) . As 
in chapter g, I write PERIODS and INSTANTS to refer to these sets, and PERIODS* 
to refer to PERIODS U {0}. 

A temporal element over (CHRONS , < db ) is a non-empty (but not necessarily convex) 
set of chronons. TELEMS (chrons ,^. db ) ( or simply TELEMS) is the set of all temporal 
elements over (CHRONS, < db ). Obviously, PERIODS C TELEMS. For every I £ 
TELEMS , mxlpers(l) is the set of the maximal periods of I, defined as follows: 

mxlpers(l) d = {p C I \ p e PERIODS and for no p € PERIODS is it true that 

p C I and p C p } 

The mxlpers symbol is overloaded. When I G TELEMS , mxlpers(l) is defined as 
above. When 5* is a set of periods, mxlpers(S) is defined as in section |3[ 



Tsql2 supports multiple granularities. These correspond to Top complete parti- 
tionings. A granularity can be thought of as a set of periods over (CHRONS ,< db ) 
(called granules), such that no two periods overlap, and the union of all the peri- 
ods is CHRONS. A lattice is used to capture relations between granularities (e.g. a 
year-granule contains twelve month-granules, etc; see chapter 19 of | Snodgrass 95| ). 



INSTANTS , also called the granularity of chronons, is the finest available granularity. 

1 TSQL2 distinguishes between valid-time chronons, transaction-time chronons, and bitemporal 
chronons (pai rs each comprising a vali d-tim e and a transaction-time chronon; see chapter 10 of 



Snodgrass 95]). As noted in section 1.2.4, transaction-time is ignored in this thesis. Hence, 



transaction-time and bitemporal chronons are not used, and "chronon" refers to valid-time chronons. 
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Tsql2 allows periods and temporal elements to be specified at any granularity. For 
example, one may specify that the first day of a period is 25/11/95, and the last day is 
28/11/95. If the granularity of chronons is finer than the granularity of days, the exact 
chronons within 25/11/95 and 28/11/95 where the period starts and ends are unknown. 
Similarly, if a temporal element is specified at a granularity coarser than INSTANTS , 
the exact chronon-boundaries of its maximal periods are unknown^] These are exam- 
ples of indeterminate temporal information (see chapter 18 of [ [Snodgrass 95 ]). Infor- 



mation of this kind is ignored in this thesis. I assume that all periods and temporal 
elements are specified at the granularity of chronons, and that we know exactly which 
chronons are or are not included in periods and temporal elements. Granularities other 
than INSTANTS will be used only to express durations (see below). 

Finally, Tsql2 uses the term interval to refer to a duration (see comments in section 



3.1). An interval is a number of consecutive granules of some particular granularity 



(e.g. two day-granules, five minute-granules). 
5.2.3 BCDM 



As noted in section 1.2.4, numerous temporal versions of the relational model have 



been proposed. Tsql2 is based on a version called Bcdm. Apart from the relations of 



the traditional relational model (section |5.2.1 ), which are called snapshot relations in 



Tsql2, Bcdm provides valid-time relations, transaction-time relations, and bitemporal 
relations. Transaction-time and bitemporal relations are not used in this thesis (see 
chapter 10 of | Snodgrass 95| 1). Valid-time relations are similar to snapshot relations, 



except that they have a special extra attribute (the implicit attribute) that shows when 
the information of each tuple was/is/will be true. 

A special domain Dt G T>d is assumed, whose elements denote the elements of 
TELEMS (temporal elements). For every v% G Dt, foivt) G TELEMS; and for 
every I G TELEMS , fp (I) G Dt- Dt is the domain of the implicit attribute. Since 
Dt £ T>t>, Dt Q D (D is the union of all the domains in T>t>)- The assumptions of 



section 5.2.1 about fn still hold: I assume that fjj is an injective and surjective func- 
tion from D (which now includes Dt) to OBJS db . Since the elements of Dt denote 



all the elements of TELEMS , Dt Q D, and OBJS db contains all the objects denoted 



To bypass this problem, in [ Androutsopoulos el al. 95a] periods and temporal elements are defined 



as sets of granules (of any granularity) rather than sets of chronons. 
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by elements of D, it must be the case that TELEMS C OBJS db . Then, the fact that 
PERIODS C TELEMS (section gjg ) implies that PERIODS C OBJS db . 

A valid-time relation r over a relation-schema i? = (Ai, A2, . . . , A„) is a subset of 
D(^i) x D(A 2 ) x • • • x D(A n ) x D T , where -D(vli), £>(A 2 ), • • • , D(A n ) are the domains 
of Ai, A 2 , ... , A n . A 2j ... , A n are the explicit attributes of r. I use the nota- 
tion (ui,U2> • • • 7 v n', v t) to refer to tuples of valid-time relations. If r is as above and 
(vx,v 2 ,. . . ,v n ;v t ) G r, then v\ G £>(Ai), u 2 G D(A 2 ), • • • , v n G D(A n ), and v t G jDt- 
t>i, t> 2 , ... , v n are the values of the explicit attributes, while vt is the value of the 
implicit attribute and the time-stamp of the tuple. In snapshot relations, all attributes 
count as explicit. In the rest of this thesis, "valid-time relation" on its own refers to a 
valid-time relation over any relation-schema. 

Tsql2 actually distinguishes between state valid-time relations and event valid-time 



relations (see chapter 16 of [ Snodgrass 95 |). These are intended to model situations 
that have duration or are instantaneous respectively. This distinction seems particu- 
larly interesting, because it appears to capture some facets of the aspectual taxonomy 
of chapter ^. Unfortunately, it is also one of the most unclear and problematically 
defined features of Tsql2. The time-stamps of state and event valid-time relations 
are supposed to denote "temporal elements" and "instant sets" respectively. "Tem- 
poral elements" are said to be unions of periods, while "instant sets" simply sets of 
chronons (see p. 314 of | Snodgrass 95| 1). This distinction between "temporal elements" 
and "instant sets" is problematic. A union of periods is a union of convex sets of 
chronons, i.e. simply a set of chronons. (The union of two convex sets of chronons 
is not necessarily convex.) Hence, one cannot distinguish between unions of periods 
and sets of chronons (see also section 2 of |Androutsopoulos et al. 95a| ). In section 3.3 
of [Androutsopoulos et al. 95a] we also argue that Tsql2 does not allow specifying 
whether a computed valid-time relation should be state or event. Given these prob- 
lems, I chose to drop the distinction between state and event valid-time relations. I 
assume that the time-stamps of all valid-time relations denote temporal elements, with 
temporal elements being sets of chronons. 

For example, assuming that the domains of employee and salary are as in section 



5.2.1, valsalaries below is a valid-time relation over (employee, salary) , shown in its 
tabular form (the double vertical line separates the explicit attributes from the implicit 



one). According to chapter 10 of Snodgrass 95 1, the elements of Dt are non-atomic. 
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Each element vt of Dt is in turn a set, whose elements denote the chronons that belong 
to the temporal element represented by vt- 



valsalaries 


employee 


salary 




J.Adams 
J.Adams 
J.Adams 
T.Smith 
T.Smith 


17000 
18000 
18500 
19000 
21000 


{c[, C2, C3, . . . , c i ni } 
{Ci, c 2 , c 3 , . . . , c„ 2 } 
{c 1 , c 2 , c 3 , . . . , c„ 3 } 
{c 1; c 2 , c 3 , . . . , c„ 4 } 
{c 1; c 2 , c 3 , . . . , c^ 5 } 



For example, c\, qjj, C3, . . . , in the first tuple for J.Adams above represent all the 
chronons where the salary of J.Adams was/is/will be 17000. {c\, c\,c\, . . . , c^ x } is an 
element of Dt- For simplicity, when depicting valid-time relations I often show (in 
an informal manner) the temporal elements denoted by the time-stamps rather the 
time-stamps themselves, valsalaries would be shown as below, meaning that the 
time-stamp of the first tuple represents a temporal element of two maximal periods, 
1/1/92 to 12/6/92 and 8/5/94 to 30/10/94. (I assume here that chronons correspond 
to days, now refers to the current chronon.) 

(5.1) 



valsalaries 


employee 


salary 




J.Adams 
J.Adams 
T.Smith 


17000 
18000 
21000 


[1/1/92, 12/6/92] U [8/5/94, 30/10/94] 
[13/6/92, 7/5/94] U [31/10/94, now] 
[15/6/92, now] 



Two tuples (v\, . . . , i4; v}) and (vf , . . . , v%) are value-equivalent iff if v\ 



A valid-time relation is coalesced iff it contains no value-equivalent tuples. 



Bcdm requires all valid-time relations to be coalesced (see p. 188 of | Snodgrass 95| ). 
For example, ( |5.2[) is not allowed (its first and third tuples are value-equivalent). In 
this thesis, this Bcdm restriction is dropped, and ( ^.2[) is allowed. 

(5.2) 



employee 


salary 




J.Adams 
J.Adams 
J.Adams 
J.Adams 
T.Smith 


17000 
18000 
17000 
18000 
21000 


[1/1/92, 12/6/92] 
[13/6/92, 7/5/94] 
[8/5/94, 30/10/94] 
[31/10/94, now] 
[15/6/92, now] 



By the definition of Dt, the elements of Dt denote all the elements of TELEMS 

(temporal elements). Since PERIODS C TELEMS ', some of the elements of Dt denote 

periods. Dp is the subset of all elements of Dt that denote periods]^ I also assume that 
3 |Snodgrass 95[ seems to adopt a different approach, where Dp n Dt = 0. 
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there is a special value v e G D, that is used to denote the empty set (of chronons). For 
example, a Tsql2 expression that computes the intersection of two non-overlapping 
periods evaluates to v e ^ I use Dp to refer to Dp U {v e }. 

The following notation will prove useful: 

• VRELp is the set of all valid-time relations whose time-stamps are all elements 
of Dp (all the time-stamps denote periods). 

• NVRELp is the set of all the (intuitively, "normalised") relations r G VRELp 
with the following property: if (v\, . . . , v n ; v}) G r, (v%, . . . , v n ; v|) G r, and 
fo(vt) U foiyl ) G PERIODS , then i£ = ■ This definition ensures that in any 
r G NVRELp, there is no pair of different value-equivalent tuples whose time- 
stamps vl and Vf denote overlapping or adjacent periods (because if the periods 
of vj and overlap or they are adjacent, their union is also a period, and then 
it must be true that v} = u|, i.e. the value-equivalent tuples are not different). 

• SREL is the set of all snapshot relations. 

• For every n G {1, 2, 3, . . . }, VRELp(n) contains all the relations of VRELp that 
have n explicit attributes. Similarly, NVRELp(n) and SREL(n) contain all the 
relations of NVRELp and SREL respectively that have n explicit attributes. 

To simplify the proofs in the rest of this chapter, I include the empty relation in all 
VRELp(n), NVREL P (n), SREL(n), for n = 1, 2, 3, ... . 



5.2.4 The TSQL2 language 



This section is an introduction to the features of Tsql2 that are used in this thesis. 



SELECT statements 



As noted in section 1.2.4 , Tsql2 is an extension of Sql-92. Roughly speaking, Sql-92 
queries (e.g. |5l^) consist of three clauses: a SELECT, a FROM, and a WHERE clause. (The 
term SELECT statement will be used to refer to the whole of a Sql-92 or Tsql2 query.) 



4 Table 8.3 of |Snodgrass 9st implies that v e is the special "null" value. In SQL, null has several roles. 
Here, I assume that there is a special value v E whose only role is to denote the empty set. 
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(5.3) SELECT DISTINCT sal. salary 

FROM salaries AS sal, managers AS mgr 

WHERE mgr. manager = 'J. Adams' AND sal. employee = mgr. managed 



Assuming that salaries and managers are as below, ( p.3|) generates the third relation 
below. 



salaries 


employee 


salary 


J.Adams 


17000 


T.Smith 


18000 


G. Papas 


14500 


B. Hunter 


17000 


K.Kofen 


16000 



managers 


manager 


managed 


J.Adams 
J.Adams 
J.Adams 
T.Smith 
T.Smith 


G. Papas 
B. Hunter 
J.Adams 
K.Kofen 
T.Smith 



(result) 



salary 



17000 
14500 



(|5.3| ) generates a snapshot one-attribute relation that contains the salaries of all em- 
ployees managed by J.Adams. The FROM clause of ( |5.3j ) shows that the query operates 
on the salaries and managers relations, sal and mgr are correlation names. They 
can be thought of as tuple- variables ranging over the tuples of salaries and managers 
respectively. The (optional) WHERE clause imposes restrictions on the possible combina- 
tions of tuple-values of sal and mgr. In every combination, the manager value of mgr 
must be J.Adams, and the managed value of mgr must be the same as the employee 
value of sal. For example, {J.Adams, G. Papas) and {G. Papas, 14500) is an accept- 
able combination of mgr and sal values respectively, while {J.Adams, G. Papas) and 
{B. Hunter, 17000) is not. 

In Sql-92 (and Tsql2), correlation names are optional, and relation names can be 



used to refer to attribute values. In (5.3), for example, one could omit AS mgr, and 
replace mgr. manager and mgr. managed by managers .manager and managers .managed. To 



simplify the definitions of section 5.4 below, I treat correlation names as mandatory, 



and I do not allow relation names to be used to refer to attribute values. 



The SELECT clause specifies the contents of the resulting relation. In (|5.3| ), it specifies 
that the resulting relation should have only one attribute, salary, and that for each 
acceptable combination of sal and mgr values, the corresponding tuple of the resulting 
relation should contain the salary value of sal's tuple. The DISTINCT in ( |5.3[) causes 
duplicates of tuples to be removed from the resulting relation. Without the DISTINCT 
duplicates are not removed. The result of (|5.3f) would contain two identical tuples 
(17000), deriving from the tuples for J.Adams and B. Hunter in salaries. This is 
against the set-theoretic definition of relations of sections 5.2.1 and 5.2.3| (relations 
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were defined to be sets of tuples, and hence cannot contain duplicates.) To ensure that 
relations contain no duplicates, in this thesis SELECT statements always have a DISTINCT 
in their SELECT clauses. 

Tsql2 allows SELECT statements to operate on valid-time relations as well. A SNAPSHOT 
keyword in the SELECT statement indicates that the resulting relation is snapshot. 
When the resulting relation is valid-time, an additional VALID clause is present. In 
the latter case, the SELECT clause specifies the values of the explicit attributes of the 
resulting relation, while the VALID clause specifies the time-stamps of the resulting 



tuples. Assuming, for example, that vaLsalaries is as in (|5.lD , ( |5.4[) returns (^J 
(5.4) 



SELECT DISTINCT sal . employee , sal. salary 
VALID PERIOD (BEGIN (VALID (sal) ) , END (VALID (sal) ) ) 
FROM val_salaries AS sal 



(5.5) 



employee 


salary 




J.Adams 
J.Adams 
T.Smith 


17000 
18000 
21000 


[1/1/92, 30/10/94] 
[13/6/92, now] 
[15/6/92, now] 



The VALID keyword is used both to start a VALID-clause (a clause that specifies the 
time-stamps of the resulting relation) and to refer to the time-stamp of the tuple- value 



of a correlation name. In (5.4), VALID (sal) refers to the time-stamp of sal's tuple (i.e. 
to the time-stamp of a tuple from valsalaries). BEGIN (VALID (sal) ) refers to the first 
chronon of the temporal element represented by that time-stamp, and END (VALID (sal) ) 
to the last chronon of that temporal element. f\ The PERIOD function generates a period 
that starts at the chronon of its argument, and ends at the chronon of its second 
argument. Hence, each time-stamp of ( |5.5| ) represents a period that starts/ends at 
the earliest/latest chronon of the temporal element of the corresponding time-stamp 
of valsalaries. 



Literals 



TSQL2 provides PERIOD literals, INTERVAL literals, and TIMESTAMP literals (the use of 
"TIMESTAMP" in this case is unfortunate; these literals specify time-points, not time- 
stamps of valid-time relations, which denote temporal-elements). For example, PERIOD 



5 Section 30.5 of Jgnodgrass 95| ] allows BEGIN and END to be used only with periods. I see no reason 
for this limitation. I allow BEGIN and END to be used with any temporal element. 
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' [March 3, 1995 - March 20, 1995] ' is a literal that specifies a period at the gran- 
ularity of days. If chronons are finer than days, the assumption in Tsql2 is that 
the exact chronons within March 3 and March 20 where the period starts and ends 
are unknown (section 5.2.2] ). In this thesis, PERIOD literals that refer to granularities 
other than that of chronons are abbreviations for literals that refer to the granularity 
of chronons. The denoted period contains all the chronons that fall within the gran- 
ules specified by the literal. For example, if chronons correspond to minutes, PERIOD 
' [March 3, 1995 - March 20, 1995] ' is an abbreviation for PERIOD ' [00:00 March 3, 
1995 - 23:59 March 20, 1995]'. 

Tsql2 supports multiple calendars (e.g. Gregorian, Julian, lunar calendar; see chapter 
7 of fSnodgrass 95[| ). The strings that can appear between the quotes of PERIOD lit- 
erals (e.g. '[March 3, 1995 - March 20, 1995]', '(3/4/95 - 20/4/95] ') depend on the 
available calendars and the selected formatting options (see chapter 7 of [ Snodgrass 95f| ). 
The convention seems to be that the boundaries are separated by a dash, and that the 
first and last characters of the quoted string are square or round brackets, depend- 
ing on whether the boundaries are to be included or not. I also assume that PERIOD 
'today' can be used (provided that chronons are at least as fine as days) to refer to the 
period that covers all the chronons of the present day. (There are other Tsql2 expres- 
sions that can be used to refer to the current day, but I would have to discuss Tsql2 
granularity-conversion commands to explain these. Assuming that PERIOD 'today' is 
available allows me to avoid these commands.) 

TIMESTAMP literals specify chronons. Only the following special TIMESTAMP literals are 
used in this thesis: TIMESTAMP 'beginning', TIMESTAMP 'forever', TIMESTAMP 'now'. 
These refer to the beginning of time, the end of time, and the present chronon. 

An example of an INTERVAL literal is INTERVAL ' 5 ' DAY, which specifies a duration of five 
consecutive day-granules. The available granularities depend on the calendars that are 
active. The granularities of years, months, days, hours, minutes, and seconds are sup- 
ported by default. Intervals can also be used to shift periods or chronons towards the 
past or the future. For example, PERIOD ' [1991 - 1995] ' + INTERVAL '1' YEAR is the 
same as PERIOD '[1992 - 1996]'. If chronons correspond to minutes, PERIOD (TIMESTAMP 
'beginning', TIMESTAMP 'now' - INTERVAL '1' MINUTE) specifies the period that cov- 
ers all the chronons from the beginning of time up to (but not including) the current 
chronon. 
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Other TSQL2 functions and predicates 

The INTERSECT function computes the intersection of two sets of chronons.[] For ex- 
ample, INTERSECT (PERIOD '[May 1, 1995 - May 10, 1995]', PERIOD '[May 3, 1995 - 
May 15, 1995]') is the same as PERIOD '[May 3, 1995 - May 10, 1995]'. If the in- 



tersection is the empty set, INTERSECT returns v e (section |5.2.3 ) 



The CONTAINS predicate checks if a chronon belongs to a set of chronons. For example, 
if valsalaries is as in ( |5.l[) , ( |5.6[) generates a snapshot relation showing the current 
salary of each employee. CONTAINS can also be used to check if a set of chronons is a 
subset of another set of chronons. [| 

(5.6) SELECT DISTINCT SNAPSHOT sal . employee , sal. salary 
FROM val_salaries AS sal 

WHERE VALID(sal) CONTAINS TIMESTAMP 'now' 

The PRECEDES predicate checks if a chronon or set of chronons strictly precedes an- 
other chronon or set of chronons. Section 8.3.6 of | Snodgrass 95(1 specifies the se- 



mantics of PRECEDES only in cases where its arguments are chronons or periods. I 
assume that expr\ PRECEDES expr2 is true, iff the chronon of expr\ (if expr\ specifies 
a single chronon) or all the chronons of expri (if expr\ specifies a set of chronons) 
strictly precede the chronon of expr<i (if expr<i specifies a single chronon) or all the 
chronons of expr2 (if expr2 specifies a set of chronons). For example, PERIOD ' [1/6/95 
- 21/6/95] ' PRECEDES PERIOD ' [24/6/95 - 30/6/95] ' is true, but PERIOD ' [1/6/95 - 
21/6/95] ' PRECEDES PERIOD ' [19/6/95 - 30/6/95] ' is not. 

Embedded SELECT statements 

Tsql2 (and Sql-92) allow embedded SELECT statements to be used in the FROM clause, 
in the same way that relation names are used (e.g. flsT?]) ). 



Section 8.3.3 of fcnod grass 95| requires both arguments of INTERSECT to denote periods, but section 



30.14 allows the arguments of INTERSECT to denote temporal elements. I follow the latter. I also 
allow the arguments of INTERSECT to denote the empty set. 



7 Table 8.7 in section 8.3.6 and additional syntax rule 3 in section 32.4 of [Snodgrass 95] allow the 



argume nts of CONTAIN S to denote periods but not generally temporal elements. Table 32.1 in section 



32.4 of Snodgrass 95 1, however, allows the arguments of CONTAINS to denote temporal elements. 
I follow the latter. I also allow the arguments of CONTAINS to denote the empty set. The same 
comments apply in the case of PRECEDES. 
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(5.7) SELECT DISTINCT SNAPSHOT sal2. salary 
FROM (SELECT DISTINCT sail. salary 
VALID VALID(sall) 
FROM val_salaries AS sail 
) AS sal2 
WHERE sal2. salary > 17500 

Assuming that valsalaries is as in (|5.1| ), the embedded SELECT statement above simply 
drops the employee attribute of vaLsalaries, generating (|5.8|) . sal2 ranges over the 
tuples of (|5.8|) . ( |5.7l ) generates a relation that is the same as (|5.8|) , except that tuples 
whose salary values are not greater than 17500 are dropped. 



salary 




17000 
18000 
21000 


[1/1/92, 12/6/92] U [8/5/94, 30/10/94] 
[13/6/92, 7/5/94] U [31/10/94, now] 
[15/6/92, now] 



Partitioning units 



In Tsql2, relation names and embedded SELECT statements in the FROM clause can 
be followed by partitioning units.^ Tsql2 currently provides two partitioning units: 
(PERIOD) and (INSTANT) (see section 30.3 and chapter 12 of [ pnodgrass~9~5| ]). (INSTANT) 
is not used in this thesis. Previous Tsql2 versions (e.g. the September 1994 version of 
chapter 12 of [pnodgrass 95 1) provided an additional (ELEMENT). For reasons explained 
below, (ELEMENT) is still used in this thesis. 



(ELEMENT) merges value-equivalent tuples.^ For example, if rell is the relation of ( |5,9| 
( 5. 10| ) generates the coalesced relation of ( |5 .11 ). 



(5.9) 



rell 


employee 


salary 




J.Adams 


17000 


[1986, 1988] 


J.Adams 


17000 


[1987, 1990] 


J.Adams 


17000 


[1992, 1994] 


G. Papas 


14500 


[1988, 1990] 


G. Papas 


14500 


[1990, 1992] 



followed by partitioning units in FROM clauses. |Snodgrass et al. 94b (queries Q.l.2.2, Q.l.2.5 



Q. 1.7.6), howe ver, shows SELECT sta tements embedded in FROM clauses and followed by partitioning 



units. I follow [Snodgrass et al. 94b]. 
9 The semantics of (ELEMENT) was never clear. The discussion here reflects my understanding of the 
September 1994 TSQL2 documentation, and the semantics that is assigned to (ELEMENT) in this 
thesis. 
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(5.10) 



(5.11) 



SELECT DISTINCT rl . employee , rl. salary 

VALID VALID (rl) 

FROM rell (ELEMENT) AS rl 



employee 


salary 




J.Adams 
G. Papas 


17000 
14500 


[1986, 1990] U [1992, 1994] 
[1988, 1992] 



The effect of (ELEMENT) on a valid-time relation r is captured by the coalesce function: 

coalesce(r) = {{v±, . . . , v n ; vt) \ (v±, . . . , v n ; v' t ) G r and 

f D (v t ) = |J f D (v't)} 
(vi,...,v n ;v"}& 

(ELEMENT) has no effect on already coalesced valid-time relations. Hence, in the Bcdm 



version of Snodgrass 95 1, where all valid-time relations are coalesced, (ELEMENT) is 
redundant (and this is probably why it was dropped). In this thesis, valid-time relations 
are not necessarily coalesced (section |5.2.3| ), and (ELEMENT) plays an important role. 

(PERIOD) intuitively breaks each tuple of a valid-time relation into value-equivalent 
tuples, each corresponding to a maximal period of the temporal element of the original 
time-stamp. Assuming, for example, that rel2 is the relation of ( 5.11| ), ( |5.12j ) generates 



(5.12) 



(5.13) 



SELECT DISTINCT r2 . employee , r2. salary 

VALID VALID (r2) 

FROM rel2 (PERIOD) AS r2 



employee 


salary 




J.Adams 
J.Adams 
G. Papas 


17000 
17000 
14500 


[1986, 1990] 
[1992, 1994] 
[1988, 1992] 



As the example shows, (PERIOD) may generate non-coalesced relations. This is myste- 



rious in the Bcdm version of [ Snodgrass 95 ], where non-coalesced valid-time relations 
are not allowed. The assumption seems to be that although non-coalesced valid-time 
relations are not allowed, during the execution of SELECT statements temporary non- 
coalesced valid-time relations may be generated. Any resulting valid-time relations, 
however, are coalesced automatically at the end of the statement's execution. ( |5.13 ) 
would be coalesced automatically at the end of the execution of ( pM2| ) (cancelling, in 
this particular example, the effect of (PERIOD)). In this thesis, no automatic coalescing 
takes place, and the result of ( |5.12j ) is ( 5.13[ ). 
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To preserve the spirit of (PERIOD) in the Bcdm version of this thesis where valid-time 
relations are not necessarily coalesced, I assume that (PERIOD) operates on a coalesced 



copy of the original relation. Intuitively, (PERIOD) first causes (|5.9|) to become (5.11), 



and then generates (5.13). The effect of (PERIOD) on a valid-time relation r is captured 



by the pcoalesce function: 

pcoalesce(r) d = {(v\, . . . , v n ;vt) \ (v\, . . . , v n ; v' t ) E coalesce(r) and 
f D {v t ) 6 mxlpers(f D (v' t ))} 

5.3 Modifications of TSQL2 

This thesis adopts some modifications of Tsql2. Some of the modifications were 



mentioned in section 5.2. The main of those were: 



• The requirement that all valid-time relations must be coalesced was dropped. 

• The distinction between state and event valid-time relations was abandoned. 

• (ELEMENT) was re-introduced. 

• The semantics of (PERIOD) was enhanced, to reflect the fact that in this thesis 
valid-time relations are not necessarily coalesced. 

• All periods and temporal elements are specified at the granularity of chronons. 
Literals referring to other granularities are used as abbreviations for literals that 
refer to the granularity of chronons. 

This section describes the remaining TSQL2 modifications of this thesis. 

5.3.1 Referring to attributes by number 



In Tsql2 (and Sql-92) explicit attributes are referred to by their names. In d5lj ), 
for example, sal. salary refers to the salary attribute of vaLsalaries. 



(5.14) 



SELECT DISTINCT sal . salary 

VALID VALID (sal) 

FROM val_salaries AS sal 
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In the Tsql2 version of this thesis, explicit attributes are referred to by number, 
with numbers corresponding to the order in which the attributes appear in the re- 
lation schema (section |5.2.1 ). For example, if the relation schema of valsalaries is 
{employee, salary) , employee is the first explicit attribute and salary the second one. 
( 5.15| ) would be used instead of ( 5.14 ). To refer to the implicit attribute, one still uses 
VALID (e.g. VALID(sal)). 

(5.15) SELECT DISTINCT sal. 2 
VALID VALID (sal) 
FROM salaries AS sal 

Referring to explicit attributes by number simplifies the Top to Tsql2 translation, 
because this way there is no need to keep track of the attribute names of the various 
relations. 

5.3.2 (SUBPERIOD) and (NOSUBPERIOD) 



Two new partitioning units, (SUBPERIOD) and (NOSUBPERIOD), were introduced for the 
purposes of this thesis. (SUBPERIOD) is designed to be used with relations from VRELp 
(section [5.2.3 ). The effect of (SUBPERIOD) on a relation r is captured by the subperiod 
function: 

subperiod(r) d = {(vi, . . .,v n ;v t ) \ (v 1} ...,v n ;v' t ) e r and /c^t) E /z?K)} 

For each tuple {vi, . . . , v n ; v' t ) € r, the resulting relation contains many value-equivalent 
tuples of the form (vi, . . . ,v n ;v t ), one for each period fnivt) that is a subperiod of 
fr)(v' t ). Assuming, for example, that chronons correspond to years, and that rel is the 
relation of ( |5. 16 ), ( 5.17| ) returns the relation of ( |5.18 ). 



(5.16) 



J.Adams 


17000 


[1992, 


1993] 


G. Papas 


14500 


[1988, 


1990] 


G. Papas 


14500 


[1990, 


1991] 



(5.17) 



SELECT DISTINCT r.l, r.2 

VALID VALID (r) 

FROM rel (SUBPERIOD) AS r 
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J.Adams 


17000 


[1992, 


1993] 


J.Adams 


17000 


[1992, 


1992] 


J.Adams 


17000 


[1993, 


1993] 


G. Papas 


14500 


[1988, 


1990] 


G.rapas 


14500 


ri no o 
[19»o, 


1 no ol 
1988J 


G. Papas 


14500 


[1988, 


1989] 


G. Papas 


14500 


[1989, 


1989] 


G. Papas 


14500 


[1989, 


1990] 


G. Papas 


14500 


[1990, 


1990] 


G. Papas 


14500 


[1990, 


1991] 


G. Papas 


14500 


[1991, 


1991] 



The first three tuples of (5.18) correspond to the first tuple of (5.16). The following 
six tuples correspond to the first tuple of G. Papas in ( |5.16| ), The remaining tuples of 
( 5.18| ) derive from the second tuple of G. Papas in ( 5.16D (the tuple for the subperiod 
[1990, 1990] has already been included in ( [5.18|) ). Notice that (SUBPERIOD) does not 
coalesce the original relation before generating the result (this is why there is no tuple 
for G.Papas time-stamped by [1988, 1991] in ( pTl8|) ). 



Obviously, the cardinality of the resulting relations can be very large (especially if 
chronons are very fine, e.g. seconds). The cardinality, however, is never infinite (as- 
suming that the cardinality of the original relation is finite): given that time is discrete, 
linear, and bounded, any period p is a finite set of chronons, and there is at most a 
finite number of periods (convex sets of chronons) that are subperiods (subsets) of p; 
hence, for any tuple in the original relation whose time-stamp represents a period p, 
there will be at most a finite number of tuples in the resulting relation whose time- 
stamps represent subperiods of p. It remains, of course, to be examined if (SUBPERIOD) 
can be supported efficiently in Dbmss. It is obviously very inefficient to store (or print) 
individually all the tuples of the resulting relation. A more space-efficient encoding of 
the resulting relation is needed. I have not explored this issue. 

Roughly speaking, (SUBPERIOD) is needed because during the TOP to TSQL2 transla- 
tion every Top formula is mapped to a valid-time relation whose time-stamps denote 
the event-time periods where the formula is true. Some (but not all) formulae are 
homogeneous (section |3.6| ). For these formulae we need to ensure that if the valid- 
time relation contains a tuple for an event-time et, it also contains tuples for all the 



subperiods of et. This will become clearer in section 5.11. 
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(N0SUBPERI0D) is roughly speaking used when the effect of (SUBPERIDD) needs to be can- 
celled. (N0SUBPERI0D) is designed to be used with relations from VRELp. It eliminates 
any tuple (v%, . . . , v n ; Vt), for which there is a value-equivalent tuple (vi, . . . ,v n ;v' t ), 
such that fo{vt) C fnWt). The effect of (NDSUBPERIDD) on a valid-time relation r is 
captured by the nosubperiod function: 

nosubperiod(r) d = . . . , v n ; Vt) 6 r | there is no (v±, . . . , v n ; v' t ) G r 

such that f D {v t ) C /dK)} 
Applying (NDSUBPERIDD) to Q5.18| ) generates ( |5.19| ). 



(5.19) 



J.Adams 


17000 


[1992, 1993] 


G. Papas 


14500 


[1988, 1990] 


G. Papas 


14500 


[1990, 1991] 



Although (SUBPERI0D) and (NOSUBPERIOD) are designed to be used (and in practice will 
always be used) with relations from VRELp, I allow (SUBPERIOD) and (NOSUBPERIOD) to 
be used with any valid-time relation. In the proofs of appendix this saves me having 
to prove that the original relation is an element of VRELp whenever (SUBPERIOD) and 
(NOSUBPERIOD) are used. 



5.3.3 Calendric relations 



As mentioned in section |5.2.4| , Tsql2 supports multiple calendars. Roughly speak- 
ing, a Tsql2 calendar describes a system that people use to measure time (Gregorian 
calendar, Julian calendar, etc.). Tsql2 calendars also specify the meanings of strings 
within the quotes of temporal literals, and the available granularities. According to 



section 3.2 of [ Snodgrass 95 ], Tsql2 calendars are defined by the database administra- 
tor, the Dbms vendor, or third parties. In this thesis, I assume that TSQL2 calendars 
can also provide calendric relations. Calendric relations behave like ordinary relations 
in the database, except that they are defined by the creator of the Tsql2 calendar, 
and cannot be updated. 

The exact purpose and contents of each calendric relation are left to the calendar 
creator. I assume, however, that a calendric relation provides information about the 
time-measuring system of the corresponding Tsql2 calendar .q] The Gregorian Tsql2 

10 Future work could establish a more systematic link between calendric relations and TSQL2 calendars. 
For example, calendric relations could be required to reflect (as a minim um) the lattice that shows 



how the granularities of the calendar relate to each other (section 5.2.2 
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calendar could, for example, provide the calendric valid-time relation gregorian below. 
(I assume here that chronons are finer than minutes.) 



gregorian 


year 


month 


dnum 


dname 


hour 


minute 




1994 


Sept 


4 


Sun 


00 


00 


{Cni ) ' ■ 


■ ) ^H2 } 


1994 


Sept 


4 


Sun 


00 


01 


{c n3 , . . 


■ j c n 4 } 


1995 


Dec 


5 


Tue 


21 


35 


{Cn 5 , ■ ■ 


■ > c n 6 } 



The relation above means that the first minute (00:00) of September 4th 1994 (which 
was a Sunday) covers exactly the period, that starts at trie chronon c ni and ends at 
the chronon c n2 . Similarly, the period that starts at c n3 and ends at c n4 is the sec- 
ond minute (00:01) of September 4th 1994. Of course, the cardinality of gregorian 
is very large, though not infinite (time in Tsql2 is bounded, and hence there is at 
most a finite number of minute-granules). It is important, however, to realise that 
although gregorian behaves like a normal relation in the database, it does not need to 
be physically present in the database. Its tuples could be computed dynamically, when- 
ever they are needed, using some algorithm specified by the Tsql2 calendar. Other 
calendric relations may list the periods that correspond to seasons (spring-periods, 
summer-periods, etc.), special days (e.g. Easter days), etc. 

Calendric relations like gregorian can be used to construct relations that represent 



the periods of partitionings. ( 5.20 ), for example, constructs a one-attribute snapshot 
relation, that contains all the time-stamps of gregorian that correspond to 21:36- 
minutes. The resulting relation represents all the periods of the partitioning of 21:36- 
minutes. 

(5.20) SELECT DISTINCT SNAPSHOT VALID (greg) 
FROM gregorian AS greg 
WHERE greg. 5 =21 AND greg. 6 = 36 



Similarly, (ggD generates a one-attribute snapshot relation that represents the periods 
of the partitioning of Sunday-periods. The embedded SELECT statement generates a 
valid-time relation of one explicit attribute (whose value is Sun in all tuples). The 
time-stamps of this relation are all the time-stamps of gregorian that correspond to 
Sundays (there are many tuples for each Sunday). The (PERIOD) coalesces tuples that 
correspond to the same Sunday, leading to a single period-denoting time-stamp for 
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each Sunday. These time-stamps become the attribute values of the relation generated 
by the overall fljTgj] ). 

(5.21) SELECT DISTINCT SNAPSHOT VALID (greg2) 
FROM (SELECT DISTINCT gregl.4 
VALID VALID (gregl) 
FROM gregorian AS gregl 
WHERE gregl.4 = 'Sun' 
) (PERIOD) AS greg2 



In [ Androutsopoulos et al. 95a ] we argue that calendric relations constitute a generally 



useful addition to Tsql2, and that unless appropriate calendric relations are available, 
it is not possible to formulate Tsql2 queries for questions involving existential or 
universal quantification or counts over day-names, month names, season-names, etc. 
(e.g. (HI - O). 



(5.22) Which technicians were at some site on a Sunday? 

(5.23) Which technician was at Glasgow Central on every Monday in 1994? 

(5.24) On how many Sundays was J.Adams at Glasgow Central in 1994? 

5.3.4 The INTERVAL function 

Tsql2 provides a function INTERVAL that accepts a period-denoting expression as its 
argument, and returns an interval reflecting the duration of the period. The assump- 
tion seems to be that the resulting interval is specified at whatever granularity the pe- 
riod is specified. For example, INTERVAL (PERIOD ' [1/12/95 - 3/12/95] ') is the same as 
INTERVAL ' 3 ' DAY. In this thesis, all periods are specified at the granularity of chronons, 
and if chronons correspond to minutes, PERIOD ' [1/12/95 - 3/12/95] ' is an abbrevia- 
tion for PERIOD '[00:00 1/12/95 - 23:59 3/12/95]' (sections |B.2.2| and ^4]) . Hence, 



the results of INTERVAL are always specified at the granularity of chronons. When 
translating from Top to Tsql2, however, there are cases where we want the results of 
INTERVAL to be specified at other granularities. 

This could be achieved by converting the results of INTERVAL to the desired gran- 
ularities. The TSQL2 mechanisms for converting intervals from one granularity to 
another, however, are very obscure (see section 19.4.6 of [Snodgrass 95]). To avoid 



these mechanisms, I introduce an additional version of the INTERVAL function. If expr\ 
is a Tsql2 expression that specifies a period p, and expri is the Tsql2 name (e.g. 
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DAY, MONTH) of a granularity G, then INTERVAL (expr\, expr 2 ) specifies an interval of 
n granules (periods) of G, where n is as follows. If there are k consecutive granules 
91,52,53, ■■■ ,9k in G such that g\ U g 2 U (/3 U . . . U g k = p, then n = k. Other- 
wise, n = 0. For example, INTERVAL (PERIOD '[May 5, 1995 - May 6, 1995]', DAY) 
is the same as INTERVAL '2' DAY, because the period covers exactly 2 consecutive 
day-granules. Similarly, INTERVAL (PERIOD '[May 1, 1995 - June 30, 1995]', MONTH) 
is the same as INTERVAL '2' MONTH, because the period covers exactly two consecu- 
tive month-granules. In contrast, INTERVAL (PERIOD '[May 1, 1995 - June 15, 1995]', 
MONTH) is the same as INTERVAL '0' MONTH (zero duration), because there is no union 
of consecutive month-granules that covers exactly the period of PERIOD ' [May 1, 1995 
- June 15, 1995] ' . 



5.3.5 Correlation names used in the same FROM clause where they 
are defined 



The syntax of Tsql2 (and Sql-92) does not allow a correlation name to be used in a 
SELECT statement that is embedded in the same FROM clause that defines the correlation 
name. For example, ( 5.25j ) is not allowed, because the embedded SELECT statement 



uses rl, which is defined by the same FROM clause that contains the embedded SELECT 
statement. 

(5.25) SELECT . . . 

VALID VALID (rl) 
FROM rell AS rl, 
(SELECT . . . 
VALID VALID (r2) 
FROM rel2 AS r2 

WHERE VALID(rl) CONTAINS VALID(r2) 
) AS r3 
WHERE . . . 

By definition of a correlation name a, I mean the expression "AS a" that associates 
a with a relation. For example, in ( ^.25| ) the definition of rl is the "AS rl".|^| A 
correlation name a is defined by a FROM clause £, if £ contains the definition of a, and 
this definition is not within a SELECT statement which is embedded in £. For example, 
in (|5.25|) the r2 is defined by the "FROM rel2 AS r2" clause, not by the "FROM rell AS 

11 In SELECT statements that contain other embedded SELECT statements, multiple definitions of the 
same correlation name may be present (there are rules that determine the scope of each definition) . 
We do not need to worry about such cases, however, because the generated TSQL2 code of this 
chapter never contains multiple definitions of the same correlation name. 
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rl , ( . . . ) AS r3" clause. 

In this thesis, I allow a correlation name to be used in a SELECT statement that is 
embedded in the same FROM clause that defines the correlation name, provided that 
the definition of the correlation name precedes the embedded SELECT statement. (5.25) 



is acceptable, because the definition of rl precedes the embedded SELECT statement 
where rl is used. In contrast, ( 5.26| ) is not acceptable, because the definition of rl 



follows the embedded SELECT statement where rl is used. 

(5.26) SELECT ... 

VALID VALID (rl) 
FROM (SELECT . . . 

VALID VALID (r2) 

FROM rel2 AS r2 

WHERE VALID(rl) CONTAINS VALID(r2) 
) AS r3, 
rell AS rl 
WHERE . . . 



The intended semantics of statements like ( |5.25| ) should be easy to see: when evaluating 
the embedded SELECT statement, VALID (rl) should represent the time-stamp of a tuple 
from rell. The restriction that the definition of the correlation name must precede the 
embedded SELECT is imposed to make this modification easier to implement. 

The modification of this section is used in the Top to Tsql2 translation rules for 
At[(pi, <f>2\, Before[(j)\,<f>2[, and After[(j)i, (fa] (section |5.11 below and appendix |A|) . 



5.3.6 Equality checks and different domains 

Using the equality predicate (=) with expressions that refer to values from different 
domains often causes the TSQL2 (or Sql-92) interpreter to report an error. If, for 
example, the domain of the first explicit attribute of rel is the set of all integers, r.l 



in (5.27) stands for an integer. Tsql2 (and Sql-92) does not allow integers to be 



compared to strings (e.g. "J.Adams"). Consequently, ( 5.27 ) would be rejected, and an 
error message would be generated. 

(5.27) SELECT DISTINCT SNAPSHOT r.2 
FROM rel AS r 
WHERE r.l = 'J.Adams' 



In other cases (e.g. if a real number is compared to an integer), type conversions take 
place before the comparison. To by-pass uninteresting details, in this thesis I assume 
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that no type conversions occur when "=" is used. The equality predicate is satisfied 
iff both of its arguments refer to the same element of D (universal domain). No error 
occurs if the arguments refer to values from different domains. In the example of ( 5.27] ), 



r.l = 'J.Adams' is not satisfied, because r.l refers to an integer in D, 'J.Adams' to a 
string in D, and integers are different from strings. Consequently, in the Tsql2 version 
of this thesis ( 5.27| ) generates the empty relation (no errors occur). 



5.3.7 Other minor changes 

Tsql2 does not allow partitioning units to follow SELECT statements that are not em- 
bedded into other SELECT statements. For example, ( |5.28| ) on its own is not acceptable. 

(5.28) (SELECT DISTINCT rl.l, rl.2 

VALID VALID (rl) 
FROM rel AS rl 
) (PERIOD) 

SELECT statements like (|5.28j ) can be easily made acceptable by embedding them into 
another SELECT statement (e.g. (|5.29| )). 

(5.29) SELECT DISTINCT r2.1, r2.2 
VALID VALID (r2) 

FROM (SELECT DISTINCT rl.l, rl.2 
VALID VALID (rl) 
FROM rel AS rl 
) (PERIOD) AS r2 



For simplicity, I allow stand-alone statements like (|5.28[) . I assume that (|5.28[) generates 
the same relation as (|5.29j ). I also allow stand-alone SELECT statements enclosed in 
brackets (e.g. ( 5.3C| )). I assume that the enclosing brackets are simply ignored. 



(5.30) (SELECT DISTINCT rl.l, rl.2 
VALID VALID (rl) 
FROM rel AS rl 

) 



5.4 Additional TSQL2 terminology 



This section defines some additional terminology, that is used to formulate and prove 
the correctness of the Top to Tsql2 translation. 
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Column reference: A column reference is an expression of the form a.i or VALID (a) , 
where a is a correlation name and i S {1,2,3, . . . } (e.g. sal. 2, VALID(sal)). 

Binding context: A SELECT statement £ is a binding context for a column reference 
a.i or VALID (a) iff: 

• the column reference is part of S, 

is defined (in the sense of section 5.3.5| ) by the topmost FROM clause of X, and 



a 



• the column reference is not in the topmost FROM clause of S; or it is in the topmost 
FROM clause of S, but the definition of a precedes the column reference. 

By topmost FROM clause o/EI mean the (single) FROM clause of £ that does not appear 
in any SELECT statement embedded in £ (e.g. the topmost FROM clause of ( |5.32| ) is 
the "FROM tabl AS rl, ( ... ) AS r3"). We will often have to distinguish between 
individual occurrences of column references. For example, ( 5.31] ) is a binding context 



for the occurrence of VALID (rl) in the VALID clause, because that occurrence is part 
of ( |5.3lD , rl is defined by the topmost FROM clause of ( 5.31 ), and the occurrence of 



VALID(rl) is not in the topmost FROM clause of ( 5.31 ). ( 5.31| ), however, is not a binding 



context for the occurrence of VALID (rl) in the embedded SELECT statement of ( 5.31] ), 
because that occurrence is in the topmost FROM clause, and it does not follow the 
definition of rl. 

(5.31) SELECT DISTINCT rl.l, r3.2 
VALID VALID (rl) 

FROM (SELECT DISTINCT SNAPSHOT r2 . 1 , r2.2 
FROM tab2 AS r2 

WHERE VALID (r2) CONTAINS VALID (rl) 
) AS r3, 
tabl AS rl 
WHERE rl.l = 'J.Adams' 



In contrast, ( |5.32 ) is a binding context for the VALID(rl) in the embedded SELECT, 
because the definition of rl precedes that occurrence of VALID (rl). 
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(5.32) SELECT DISTINCT rl.l, r3.2 
VALID VALID (rl) 
FROM tabl AS rl, 

(SELECT DISTINCT SNAPSHOT r2 . 1 , r2.2 
FROM tab2 AS r2 

WHERE VALID (r2) CONTAINS VALID (rl) 
) AS r3 
WHERE rl.l = 'J.Adams' 



In both j5l3lD and ( ^32|) , the overall SELECT statement is not a binding context for 



r2.1, r2.2, and VALID(r2), because r2 is not defined by the topmost FROM clause of 
the overall SELECT statement. The embedded SELECT statement of ( |5.31| ) and ( |5.32j ), 
however, is a binding context for r2.1, r2.2, and VALID (r2). 

Free column reference: A column reference a.i or VALID (a) is a free column ref- 
erence in a Tsql2 expression £, iff: 

• the column reference is part of £, and 

• there is no SELECT statement in £ (possibly being the whole £) that is a binding 
context for the column reference. 



The VALID(rl) in the embedded SELECT statement of ( |5.31| ) is free in ( |5.31| ), because 



there is no binding context for that occurrence in (5.31). In contrast, the VALID(rl) in 



the VALID clause of ( |5.31| ) is not free in Q5.31| ), because ( ^.31| ) is a binding context for 
that occurrence. The VALID (r2) of ( 5.3 1| ) is not free in ( |5,31| ), because the embedded 



SELECT statement is a binding context for VALID (r2). 

A correlation name a has a free column reference in a Tsql2 expression £, iff there is 
a free column reference a.i or VALID (a) in £. For every Tsql2 expression £, FCN(^) 
is the set of all correlation names that have a free column reference in £. For example, 
if £ is ( |5.31| ), FCW(£) = {rl} (the VALID (rl) of the embedded SELECT statement is free 
in fH). 

There must be no free column references in the overall SELECT statements that are 
submitted to the Tsql2 (or Sql-92) interpreter (though there may be free column 
references in their embedded SELECT statements). Hence, it is important to prove that 
there are no free column references in the overall SELECT statements generated by the 
Top to Tsql2 translation. 
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Value expression: In Tsql2 (and Sql-92), value expression refers to expressions 
that normally evaluate to elements of D (universal domain). (The meaning of "nor- 
mally" will be explained in following paragraphs.) For example, ' J . Adams ' , VALID (sal) , 
and INTERSECT (PERIOD '[1993 - 1995]', PERIOD '[1994 - 1996]') are all value ex- 
pressions. 

Assignment to correlation names: An assignment to correlation names is a func- 
tion g db that maps every TSQL2 correlation name to a possible tuple of a snapshot or 
valid-time relation. G db is the set of all assignments to correlation names 

If a is a (particular) correlation name, {v±, V2, ■ ■ ■ ) is a (particular) tuple of a snapshot 
or valid-time relation, and g db G G db , (g db )^ Vl V2 ^ is the same as g db , except that 
it assigns (vi,V2,---) to a. (For every other correlation name, the values of g db and 
(q db )?„, „ \ are identical.) 

eval: For every Tsql2 SELECT statement or value expression £, and every st G 
CHRONS and g db G G db , eval(st,£,g db ) is the relation (if £ is a SELECT statement) 
or the element of D (if £ is a value expression) that is generated when the Tsql2 
interpreter evaluates £ in the following way: 

• st is taken to be the current chronon. 

• Every free column reference of the form a.i is treated as a value expression that 
evaluates to Vi, where Vi is the i-th attribute value in the tuple g db {a). 

• Every free column reference of the form VALID (a) is treated as a value expression 
that evaluates to vt, where vt is the time-stamp of g db (a). 

If £ cannot be evaluated in this way (e.g. £ contains a free column reference of the 
form q.4, and g db {a) = (v±, V2, V3)), eval{st,^, g db ) returns the special value error. (I 
assume that error D.) A value expression £ normally (but not always) evaluates to 
an element of D, because when errors arise eval{st,^,g db ) = error D. If, however, 
eval(st,^,g db ) / error, eval(st,£,g db ) G D. 

Strictly speaking, eval should also have as its argument the database against which £ 
is evaluated. For simplicity, I overlook this detail. Finally, if FCN(^) = (£ contains 
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no free column references), eval(st,£, g ) does not depend on g db . In this case, I write 
simply eval(st, £). 

5.5 Modifications in TOP and additional TOP terminol- 
ogy 

In the formulae generated by the English to Top translation, each Part[a,(3] is con- 
joined with a subformula that is (or contains another subformula) of the form At[(3, (j)], 
Befom[(3,(j>], or After[p,<j>] {a G PARTS, G YNFORMS, (3 G VARS, and the of 
Part is the same as that of At, Before, or After). For example, ( 5.33| ) and ( 5.35| ) are 



mapped to ( |5.34j ) and ( 5.36|) . Also the reading of ( 5.37 ) where Monday is the time 



when the tank was empty (rather than a reference time; section [2.5.5 ) is mapped to 



(5.33) Tank 2 was empty on a Monday. 

(5.34) Part[monday 9 , mon v ] A At[mon v , Past[e v , empty (tank2)]] 

(5.35) On which Monday was tank 2 empty? 

(5.36) ?mon v Part[monday 9 ,mon v ] A At[mon v , Past[e v , empty (tank2)]] 

(5.37) Tank 2 had been empty on a Monday. 

(5.38) Part[monday 9 , mon v ] A Past[el v , Perf[e2 v , At[mon v , empty (tank2)]]] 

In this chapter, I use a slightly different version of Top, where the Part is merged 
with the corresponding At, Before, or After. For example, ( 5.34| ), Q5.36 ), and ( 5.38 ) 
become ( 5.39| ), ( |5.40p , and ( 5.41 ) respectively. 



(5.39) At[monday 9 , mon v , Past[e v , empty (tank2)]] 

(5.40) ?mon v At[monday 9 ,mon v , Past[e v , empty (tank2)]] 

(5.41) Past\eV , Perf[e2 v , At[monday 9 , mon v , empty (tank2)]]] 

The semantics of At[cr,(3,4>], Before[a, (3, eft], and After[a, (3, (j)] follow (/ is f gP arts if 
a G GPARTS, and f cparts if a G CPARTS.) 

• \\At[a,(3,(j)}\\ st ' et ' lt ' 9 = T iff g(p) G /(<x) and ||^||^et,itn 3 («,g = T _ 

• \\Before[a,f3,(j)\\\ st ' et ^ 9 =TiSg(f3) G f(a) and ||0||^et,Jtn[t /irst ,mm P t(]|^r)),g = r> 



CHAPTER 5. FROM TOP TO TSQL2 218 

• \\After[a,p,(p]\\ st ' et ' lt '9 = T iff g(j3) G f(a) and \\^t,et,ltn( m axpt(\\/3\\ 9 Ui* s t],9 = T. 



In the Top version of this chapter, Part\o, 0], At[f3,cp], Before[f3, <p\, and After[(3 , 0] 
(/3 G I£4i?5) are no longer yes/no formulae, Ai[«, 0], Before[n, cp], and After[n,(p] 
(k G CONS), however, are still yes/no formulae. 

The Top version of chapter |3] is more convenient for the English to Top mapping, while 
the version of this chapter simplifies the Top to Tsql2 translation. In the prototype 
Nlitdb, there is a converter between the module that translates from English to Top 
and the Top to Tsql2 translator. The module that translates from English to Top 
maps ( 5.33| ) , ( |5.35[ ), and {|5 . 3 7| } to ( |5.34D , ( |5.36| ), and fl5.38| ) respectively. The converter 



turns (|53^ ), and (Q) into ( p9|) , ( ggg ), and ( |5lT|) , which are then passed 



to the Top to Tsql2 translator. 

The reader is reminded that the Part[a, (3,v or d\ version of Part is not used in the 
translation from English to Top (section |4,3| ). Hence, only the Part[o~, /?] form of 
Part is possible in formulae generated by the English to Top translation. In the Top 
version of this chapter, Part operators of this form are merged with At, Before, or 
After operators. Therefore, no Part operators occur in the formulae that are passed to 
the Top to Tsql2 translator. As with the At, Before, and After of chapter |3] (section 
3.2), in every At[a, (3, cp], Before[a, f3,(p], and After[a, (3, cp], I require (5 not to occur 



within <p. This is needed to prove the correctness of the Top to Tsql2 translation. 

To avoid complications in the Top to Tsql2 translation, I require that in any At\n, <p], 
Before[K,4>], or After[K,(f>] (k G CONS, <p G YNFORMS) that is passed to the Top 
to Tsql2 translator, / cons («;) G PERIODS . (The definitions of section 3.10 are more 
liberal: they allow f CO ns(n) n ot to belong to PERIODS , though if / co „ s (k) PERIODS , 
the denotation of At[n, cp], Before[n, </>], or After\n, (j)] is always F.) In practice, formu- 
lae generated by the English to Top mapping never violate this constraint. 

For every <j) G YNFORMS , r <jP (pronounced "corners cp" ) is the tuple (ti, T2, T3, . . . , r n ) , 
where r%, ... ,r n are all the constants that are used as arguments of predicates in (j>, 
and all the variables that occur in (p, in the same order (from left to right) they appear 
in (p. If a constant occurs more than once as a predicate argument in <p, or if a variable 
occurs more than once in <p, there are multiple r^s in r <jP for that constant or variable. 
If r <p~ l = {t\,T2, T3, . . . , r n ), the length of r <p~ l is n. For example, if: 

cp = Ntense[t v ,woman(p v )] A At[1991, Past[e v ,manager_of(p v , sales)}] 
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basic TOP cons pfuns culms cparts gparts 

expressions 
h' 




gparts expressions eval 



Figure 5.1: Paths from basic Top expressions to the modelled world 
then r (jP = (t v ,p v ,e v ,p v , sales) , and the length of r (jP is 5. 

5.6 Linking the TOP model to the database 



As discussed in section |3.6| , the answer to an English question submitted at st must 
report the denotation ||(/>|| M,S * of the corresponding Top formula <p. \\4>\\ M,st follows 
from the semantics of Top, provided that the model M, which intuitively provides all 
the necessary information about the modelled world, has been defined. In a Nlidb, 
the only source of information about the world is the database.^ Hence, M has to be 
defined in terms of the information in the database. This mainly involves defining / cons , 
f pfuns, f culms, f cparts, and f gparts (which are parts of M) in terms of database concepts. 

Scons, f pfuns, f culms, f cparts, and J \parts show how certain basic Top expressions (con- 
stants, predicates, and partitioning names) relate to the modelled world. These func- 
tions will be defined in terms of the functions h cons , h p f uns , h cu i ms , h cpa rts, and h gpar ts (to 
be discussed in section 5/7), and fr> (section [5.2. 1| ). Roughly speaking, the h functions 



map basic Top expressions to database constructs (attribute values or relations), and 
fo maps the attribute values of these constructs to world objects (figure |Q| ). h cons , 
hpfuns, hcuims, h cpart s, and h gpart s will in turn be defined in terms of the functions h' cons , 
tipfrnsi h 'cuims> K parts , and h' gparts (to be discussed in section [T|), and eval (section 



5.4). The h! functions map basic Top expressions to Tsql2 expressions, and eval 



maps Tsql2 expressions to database constructs. 

After defining the h! functions, one could compute using a reasoning system, 

that would contain rules encoding the semantics of Top, and that would use the path 
basic Top expressions — > Tsql2 expressions — ► database constructs — ► modelled world 



12 



This is not entirely true in the framework of this thesis, as there is also a type-hierarchy of world- 



entities in the Hpsg grammar (section 4.2) 
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resulting 




query execution relation 

Figure 5.2: Paths from TOP formulae to their denotations 



(figure |OD to compute any necessary values of f cons , f p f U ns, f culms, fcparts, and f gpar ts- 
That is, only basic Top expressions would be translated into Tsql2, and the Dbms 
would be used only to evaluate the Tsql2 translations of these expressions. The rest 
of the processing to compute ||<^|| M ' S * would be carried out by the reasoning system. 

This thesis adopts an alternative approach that exploits the capabilities of the Dbms 
to a larger extent, and that requires no reasoning system. Based on the h' functions 
(that map only basic Top expressions to Tsql2 expressions), a method to translate 
any Top formula into Tsql2 will be developed. Each Top formula <j> will be mapped 
to a single Tsql2 query (figure |5.2| ). This will be executed by the Dbms, generating 
a relation that represents (via an interpretation function) ||(/>|| M,st . It will be proven 
formally that this approach generates indeed ||</>|| M,S * (i.e. that paths 1 and 2 of figure 



5.2 lead to the same result). 

There is one further complication: the values of f cons , f p f un s, f culms, fcparts, and fgparts 
will ultimately be obtained by evaluating Tsql2 expressions returned by h' cons , h' p f uns , 
Kuims' Kparts, and h' gparts . A Tsql2 expression, however, may generate different results 
when evaluated at different times (e.g. a SELECT statement may return different results 
after a database relation on which the statement operates has been updated). This 
causes the values of f cons , f p j uns , f culms, fcparts, and fgparts to become sensitive to the 
time where the TSQL2 expressions of the b! functions are evaluated. We want this 
time to be st, so that the Tsql2 expressions of the h! functions will operate on the 
information that is in the database when the question is submitted, and so that a 
TSQL2 literal like PERIOD 'today' (section |5.2.4j ) in the expressions of the h! functions 
will be correctly taken to refer to the day that contains st. To accommodate this, f CO ns, 
fpfuns, f culms, fcparts, and f gvar ts must be made sensitive to st: 

fcons becomes a function PTS ^ {CONS ^ OBJS) instead of CONS ^ OBJS. This 
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allows the world objects that are assigned to Top constants via f cons to be different 
at different sis. Similarly, f p f uns is now a function over PTS. For every st £ PTS, 
fpfuns(st) is in turn a function that maps each pair {ir,n), where ir £ PFUNS and 
n 6 {1,2,3,... }, to another function (OBJS) n ^ pow{PERIODS) (cf. the definition 
of fpfuns in section HH). The definitions of f cu ims, fcparts, and f gpar ts are modified accord- 
ingly. Whatever restrictions applied to f cons , f p f un s, fcuims, fcparts, and fg par ts, now apply 
to f consist), fpfuns(st), fcuims(st), fcparts (st) , and f gpa rts(st), for every st £ CHRONS. 
Also, wherever f cons , f p f uns , fcuims, fgparts, fcparts were used in the semantics of Top, 
f consist), fpfuns(st), fcuims(st), and f gpar ts(st) should now be used. The Top model also 
becomes sensitive to st, and is now defined as follows: 

M(st) = ((PTS,-<), OBJS,f cons (st),f pfuns (st),f cu i ms (st),f gparts (st),f C p art s(st)) 

Intuitively, M(st) reflects the history of the world as recorded in the database at st. 
(If the database supports both valid and transaction time, M(st) reflects the "beliefs" 



of the database at st; see section 1.2.4 .) The answer to an English question submitted 



at st must now report the denotation ||0|| M ( S *)> S * of the corresponding Top formula 



5.7 The h functions 



I first discuss h cons , h p f uns , h cu i ms , K parts , and h gparts , the functions that - roughly 
speaking - map basic Top expressions to database constructs. As with f con s, fpfuns, 
fcuims , fcparts , and fgparts , the values of h cons , h p f uns , h cu ims , hcparts , and h gpar t s will 
ultimately be obtained by evaluating Tsql2 expressions at st. The results of these 
evaluations can be different at different sts, and hence the definitions of the h functions 
must be sensitive to st. 



hcons- hcons is a function PTS i ► {CONS i ► D). For every st £ PTS, h cons (st) 
is in turn a function that maps each Top constant to an attribute value that rep- 
resents the same world-entity. For example, h cons (st) could map the Top constant 
sales -department to the string attribute value Sales Department, and the constant 
today to the element of Dp (Dp C D) which denotes the day-period that contains st. 



h p funs : \funs is a function over PTS. For every st £ PTS, h p f uns (st) is in turn a func- 
tion over PFUNS x {1, 2, 3, . . . }, such that for every vr £ PFUNS and n £ {1, 2, 3, . . . }, 
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h p f uns (st)(ir,n) G NVRELp(n) (section p.2.3 ). h p f uns (st) is intended to map every 
Top predicate of functor ir and arity n to a relation that shows for which argu- 
ments of the predicate and at which maximal periods the situation represented by 
the predicate is true, according to the "beliefs" of the database at st. For example, 
if circling(ba737) represents the situation where BA737 is circling, and according to 
the "beliefs" of the database at st, p is a maximal period where BA737 was/is/will be 
circling, h p f uns (st)(circling, 1) must contain a tuple (v;vt), where /d(v) = f CO ns(ba737) 
(v denotes the flight BA737), and /r>(ut) = p. Similarly, if h p f uns (st)(circling, 1) con- 
tains a tuple (v; Vt), where /d(v) = fcons(ba737) and fo(vt) = P, P is a maximal period 
where BA737 was/is/will be circling, according to the "beliefs" of the database at st. 



hcuims^ h cu i ms is a function over PTS. For every st G PTS, h, cu \ ms (st) is in turn a func- 
tion over PFUNS x {1, 2, 3, . . . }, such that for every vr G PFUNS and n G {1, 2, 3, . . . }, 
h C uims(st)(ir,n) G SREL(n). Intuitively, h cu i ms plays the same role as f cu i ms (section 



3.4). In practice, h cu i ms is consulted only for predicates that describe situations with 
inherent climaxes. h cu i ms (st) maps each Top predicate of functor tt and arity n to 
a relation that shows for which predicate arguments the situation of the predicate 
reaches its climax at the latest time-point where the situation is ongoing, according 
to the "beliefs" of the database at st. If, for example, inspecting(j_adams, ba737) 
represents the situation where J.Adams is inspecting BA737, h p f uns (st) (inspecting, 2) 
is a relation in NVRELp(2) and h cu i ms (st) (inspecting, 2) a relation in SREL(2). If, 
according to the "beliefs" of the database at st, the maximal periods where J.Adams 
was/is/will be inspecting BA737 are pi,P2, ■ ■ ■ ,Pj, h p f uns (st) (inspecting, 2) contains 
the tuples {vi,v 2 ;v}), (v 1 ,v 2 ;v?), ... , {vi,v 2 ;vl), where /d(vi) = f con s(j-adams), 
Jd(v2) = fcons(ba737), and /d^ 1 ) = Pi, /d(^ 2 ) = p 2 , ■ ■ ■ , /d(^) = Pj- Let us assume 
that p is the latest maximal period among p\,...,pj. h cu i ms (st) (inspecting, 2) contains 
{vi,v 2 ) iff according to the "beliefs" of the database at st, the inspection of BA737 by 
J.Adams reaches its completion at the end of p. 



hgparts: hg par t s is a function over PTS. For every st G PTS, h gpar ts(st) is in turn a 
function that maps every element of GPARTS to an r G SREL(\), such that the set 
S = {$d(v) I (v) G r} is a gappy partitioning. h gpar t s (st) is intended to map each Top 
gappy partitioning name a g to a one-attribute snapshot relation r, whose attribute 
values represent the periods of the gappy partitioning S that is assigned to a g . For 
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example, h gpar ts(st) could map monday 9 to a one-attribute snapshot relation whose 
attribute values denote all the Monday-periods. 

As with the other h functions, the values of h gpar t s will ultimately be obtained by 
evaluating Tsql2 expressions at st (see section [5]9] below). The results of these evalu- 
ations can in principle be different at different sts, and this is why h gpar t s is defined to 
be sensitive to st. In practice, however, the Tsql2 expressions that are evaluated to 
obtain the values of h gpar t s will be insensitive to their evaluation time, and hence the 
values of h gparts will not depend on st. Similar comments apply to h cparts below. 



hcparts: hearts is a function over PTS. For every st £ PTS, h cpar t s (st) is in turn 
a function that maps every element of CPARTS to an r £ SREL(l), such that the 
set S = {fo{v) | (v) £ r} is a complete partitioning. h cpar t s (st) is intended to map 
each Top complete partitioning name a c to a one-attribute snapshot relation r, whose 
attribute values represent the periods of the complete partitioning S that is assigned 
to a c . For example, h cpar t s (st) could map day c to a one-attribute snapshot relation 
whose attribute values denote all the day-periods. 



5.8 The TOP model in terms of database concepts 



The Top model (see section 3^ and the revisions of section 5J5) can now be defined 
in terms of database concepts as follows. 



Point structure: (PTS, -<) d = (CHRONS, ^ chr ° ns ) 

As mentioned in section ggg , CHRONS / 0, and {CHRONS, ^ chrons ) has the proper- 
ties of transitivity, irreflexivity, linearity, left and right boundedness, and discreteness. 
Hence, (CHRONS ,< chrons ) qualifies as a point structure for Top (section |3.3| ). 

Since (PTS, <) = (CHRONS, ^rons^ PERIODS (pts,~<) = PERIODS \ghrons,*<*™~)> 
and INSTANTS (pts ,~() = INSTANTS mHRONS,< chrons )- I wr ite simply PERIODS and 
INSTANTS to refer to these sets. 



OBJS: 



OBIS = 7 OBJS 1 



Since PERIODS C OBJS db (section ggg ) and OBJS = OBJS db , PERIODS C OBJS, 



as required by section 3.4 
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def 

f C on S : For every st G PTS and k G CONS, I define /con* («*)(«) = f D(Kons(st)(n)) . 
Since h cons (st) is a function CONS \—* D, and /d is a function D OBJS db , and 



OBJS = OBJS , f consist) is a function CONS \— > OBJS, as required by section ^4 
and the revisions of section 



f p f uns : According to section ^4 and the revisions of section for every st G PTS 1 , 
/p/ims (st) must be a function: 

PFUNS x {1, 2, 3, . . . } ^ ((OBJS) 11 pow(PERIODS)) 

That is, for every 7r G PFUNS, every n G {1, 2,3,... }, and every oj., . . . , o n G OBJS, 
f P funs(st)(ir,n)(o 1 , ...,o n ) must be a set of periods. I define f p funs(st)(-K,n)(oi, . . . ,o n ) 
as follows: 

f P funs(st)(7T,n)(oi, . . . ,o n ) d = {f D (v t ) | (j^, 1 (oi ),..., Z^ 1 (o n ) ; u t ) G h pfuns (st)(7r,n)} 

The restrictions of section [5^7] guarantee that h p f uris (st)(-K ,n) G NVRELp(n), which 
implies that for every (/^(oi), . . . , /^(on); v t ) G h pfuns (st)(ir, n), f D (vt) G PERIODS. 
Hence, f p f U ns(st)(7r, n)(o\,. . . , o n ) is a set of periods as wanted. 



As discussed in section 5.7, if vr(ri, . . . ,r n ) represents some situation, and T\, . . . ,r n 
denote oi,...,o n , then h pfuns (st)(ir,n) contains (/^(oi), . . . , /^(on); ft) iff /o(«t) is 
a maximal period where the situation of vr(ri, . . . , r n ) holds. f p f U ns(st)(ir, n)(o\, . . . , o n ) 
is supposed to be the set of the maximal periods where the situation of vr(ri, . . . ,r n ) 
holds. The definition of f p f uns above achieves this. 



According to section 3.4 and the revisions of section 5.£, it must also be the case that: 



ifpi,p 2 £ fpfuns (st) (ir, n)(oi, ... ,o n ) andy>iUp 2 G PERIODS, then pi = p 2 

fpfuns, as defined above, has this property. The proof follows. Let us assume that p\ 
and p2 are as above, but p\ ^ P2- As discussed above, the assumption that pi,P2 G 
fpfuns (st)(7r,n)(oi, . . . ,o n ) implies that p\,P2 G PERIODS . 

Let v] = f^(pi) and v 2 t = f D 1 (p 2 ) (i.e. pi = fo(v}) and p 2 = /z?K 2 )). Since, 
Pi 7^ P2 and fp 1 is 1-1 (section [5.2. l|) , (pi) ^ f^(p2), i-e. v\ / v\. The definition 
of fpfuns (st)(ir,n)(oi, . . . ,o n ), the assumptions that pi,p 2 G fpfuns (st)(ir,n)(oi, . ..,o n ) 
and that p\L)p2 G PERIODS , and the fact that p\ = fo(vt) an d = /d(^|) imply 
that h p f uns (st)(ir, n) contains the value-equivalent tuples (/^ 1 (oi), . . . , /^ 1 (o n ); u^) and 
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(f D \oi), /dHO;*!), where /dK) u fD{v 2 t ) G PERIODS. This conclusion, the 
fact that h p f uns (st)(iT,n) G NVRELp(n) (see previous paragraphs), and the definition 
of NVRELp{n) (section |5.2.3| ) imply that v\ = v\, which is against the hypothesis. 
Hence, it cannot be the case that p\ ^ P2, i-e. p± = P2- Q-E.D. 



fcuims : According to section I^J and the revisions of section |5.6| , for every st G PTS, 
fculms( s t) must be a function: 

PFUNS x {1,2,3,...} i-» {{OBJS) n {T, F}) 

For every vr G PFUNS, n G {1, 2, 3, . . . }, and o\, . . . ,o n G OBJS, I define: 

def (T, if (/^(oi), . . . , fn l {o n )) G h culms (st)(ir, n) 



fculms{K,n)(oi, ... ,o n ) 



F, otherwise 



The restrictions of section 5.7, guarantee that h cu i ms {st){ir,n) G SREL(n). As dis- 



cussed in section 5.7, if a predicate 7t(tx, . . . , T n ) represents some situation with an 
inherent climax, and t\, ... , r n denote o\, ... , o n , then h cu i ms (st)(ir,n) contains 
(/^(oi), . . . , fj} l {o n )) iff the situation reaches its climax at the end of the latest maxi- 
mal period where the situation is ongoing. f C uims{st){i: , n)(oi, . . . , o n ) is supposed to be 
T iff the situation of vr(ri, . . . , r n ) reaches its climax at the end of the latest maximal 
period where it is ongoing. The definition of f cu ims above achieves this. 



f gP art s : For every st G PTS and a g G GPARTS, f gP arts(st)(a : 



= ifnir) 



(v) G 



h, 



gparts 



{st)(a g )}. The restrictions on h gparts of section 5/7 guarantee that f g parts(st)(a g ) 



is always a gappy partitioning, as required by section 3.4 and the revisions of section 



5.6. 



def 

fcpart s : For every st G PTS and a c G CPARTS, f cp arts(st)(a c ) = {/d(v) \ (v) G 



h 



cparts 



(st)(a c )}. The restrictions on h cparts of section [T7] guarantee that fcparts (st) (a c ) 



is always a complete partitioning, as required by section 3.4 and the revisions of section 



5.6. 



5.9 The h! functions 



I now discuss h' cons , h' pfuns , h' culms , h' gparts , and h' cparts , the functions that map basic 
Top expressions (constants, predicates, etc.) to Tsql2 expressions. I assume that 
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hcons : h' cons maps every Top constant k to a Tsql2 value expression £, such that 
FCN(g) = 0, and for every st G CHRONS, eval(st,£) G D. (The latter guarantees 
that eval(st,£) ^ error.) £ is intended to represent the same world object as k. For 
example, h' cons could map the Top constant sales -department to the Tsql2 value 
expression 'Sales Department', and the Top constant yesterday to PERIOD 'today' 
- INTERVAL ' 1 ' DAY. In practice, the values of h' cons need to be defined only for Top 
constants that are used in the particular application domain. The values of h' cons for 
other constants are not used, and can be chosen arbitrarily. Similar comments apply 

t° hpfunsi ^culmsi ^gpartsi an0 - departs- 

h cons is defined in terms of h' cons . For every st G CHRONS and k G CONS: 

h cons (st)(K) d = eval(st, h' cons (K)) 

The restrictions above guarantee that eval(st, h' cons (K)) G D. Hence, h cons (st) is a 
function CONS \— ► D, as required by section |5.7| . 



hpfuns : ^'pfuns ^ s a function that maps every tt G PFUNS and n G {1, 2, 3, ... } to a 
TSQL2 SELECT statement E, such that FC7V(E) = 0, and for every st G CHRONS, 
eval(st,T>) G NVREL P (n). h'fa^n^) is intended to be a Tsql2 SELECT statement 
that generates the relation to which h p f uns (st) maps it and n (the relation that shows for 
which arguments and at which maximal periods the situation described by tt(t%, . . . , r n ) 
is true). 

hpfuns is defined in terms of h' p j uns . For every st G CHRONS, ir G PFUNS, and 
ne {1,2,3,...}: 

hpfuns (st)(w,n) d = eval(st,h' pfuns (-K,n)) 

The restrictions on h' p f uns above guarantee that ev al(st, h' p j uns (it, n)) G NVRELp(n). 
Hence, h p f uns (st)(ir,n) G NVRELp(n), as required by section 5.7 . 

Let us assume, for example, that manager(T) means that r is a manager, and that 
manager _of is the relation of NVRELp(2) in ( 5.42j ) that shows the maximal periods 
where somebody is the manager of a department. (To save space, I often omit the 
names of the explicit attributes. These are not needed, since explicit attributes are 
referred to by number.) 
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J.Adams 


sales 


[1/5/93, 31/12/94] 


J.Adams 


personnel 


[1/1/95, 31/3/95] 


J.Adams 


research 


[5/9/95, 31/12/95] 


T.Smith 


sales 


[1/1/95, 7/5/95] 



h' p j uns (manager,!) could be denned to be ( 5.43Q , which generates (|5.44| ) (( |5.44 ) is 



an element of NVRELp(l), as required by the definition of h' p j uns ). The embedded 



SELECT statement of ( 5.43 ) discards the second explicit attribute of manager _o f . The 
(PERIOD) coalesces tuples that correspond to the same employees (e.g. the three periods 
for J.Adams), generating one tuple for each maximal period. 



(5.43) 



SELECT DISTINCT mgr2 . 1 

VALID VALID (mgr2) 

FROM (SELECT DISTINCT mgrl.l 
VALID VALID (mgrl) 
FROM manager_of AS mgrl 
) (PERIOD) AS mgr2 



(5.44) 



J.Adams 
J.Adams 
T.Smith 



[1/5/93, 31/3/95] 
[5/9/95, 31/12/95] 
[1/1/95, 7/5/95] 



hculms : Kulms ^ s a function that maps every tt G PFUNS and n G {1,2,3, ... } to 
a TSQL2 SELECT statement E, such that FCN(E) = 0, and for every st G CHRONS, 
eval(st,Y>) G SREL(n). h' culrns (ir,n) is intended to be a Tsql2 SELECT statement that 
generates the relation to which h cu i ms (st) maps tt and n (the relation that shows for 
which arguments of 7t(ti, . . . ,r n ) the situation of the predicate reaches its climax at 
the end of the latest maximal period where it is ongoing). 



hcuims is defined in terms of hi 
nG {1,2,3,...}: 

hcuims (st) (ir, n) d = eval(st, ti cu[ms (ir,n)) 



For every st G CHRONS, tt G PFUNS, and 

def 



The restrictions on h' culms above guarantee that eval(st,h' culms (i:,n)) G SREL(n). 
Hence, for every tt G PFUNS and n G {1,2,3,...}, h cu i ms (st)(i:,n) G SREL(n), as 



required by section 5.7. 



In the airport application, for example, inspecting(ri,T2,T^) means that an occurrence 
r\ of an inspection of T3 by T2 is ongoing, inspections is a relation of the following 
form: 
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inspections 


code 


inspector 


inspected 


status 




il58 


J.Adams 


UKim 


complete 


[9:00am 1/5/95 - 9:45am 1/5/95] 










U [10:10am 1/5/95 - 10:25am 1/5/95] 


il60 


J.Adams 


UK16Q 


incomplete 


[11:00pm 2/7/95 - 1:00am 3/7/95] 










U [6:00am 3/7/95 - 6:20am 3/7/95] 


i205 


T.Smith 


BA737 


complete 


[8:00am 16/11/95 - 8:20am 16/11/95] 


i2U 


T.Smith 


BATil 


incomplete 


[8:10am 14/2/96 - now] 



The first tuple above shows that J.Adams started to inspect UK160 at 9:00am on 
1/5/95, and continued the inspection up to 9:45am. He resumed the inspection at 
10:10am, and completed the inspection at 10:25am on the same day. status shows 
whether or not the inspection reaches its completion at the last time-point of the 
time-stamp. In the first tuple, its value is complete, signaling that the inspection was 
completed at 10:25am on 1/5/95. The inspection of the second tuple was ongoing from 
11:00pm on 2/7/95 to 1:00am on 3/7/95, and from 6:00am to 6:20am on 3/7/95. It did 
not reach its completion at 6:20am on 3/7/95 (perhaps it was aborted for ever). The 
inspection of the last tuple started at 8:10am on 14/2/96 and is still ongoing. Each 
inspection is assigned a unique inspection code, stored as the value of the code attribute. 
The inspection codes are useful to distinguish, for example, J. Adams' inspection of 
UK160 on 1/5/95 from that on 2-3/7/95 (section 3.16j ). h' p f uns (inspecting,3) and 
^'culms (inspecting, 3) are defined to be ( |5.45| ) and ( |5.46| ) respectively. 



(5.45) SELECT DISTINCT insp.l, insp.2, insp.3 
VALID VALID(insp) 

FROM inspections (PERIOD) AS insp 

(5.46) SELECT DISTINCT SNAPSHOT inspcmpl . 1 , inspcmpl.2, inspcmpl.3 
FROM inspections AS inspcmpl 

WHERE inspcmpl. 4 = 'complete' 



This causes h p f uns (st) (inspecting, 2) and h cu [ ms (st) (inspecting, 2) to be ( 5.47j ) and 



(5.48j) respectively. 



(5.47) 



(5.48) 



zl58 


J.Adams 


UKim 


[9:00am 1/5/95, 9:45am 1/5/95] 


zl58 


J.Adams 


UKim 


[10:10am 1/5/95, 10:25am 1/5/95] 


«160 


J.Adams 


UKim 


[11:00pm 2/7/95, 1:00am 3/7/95] 


«160 


J.Adams 


UKim 


[6:00am 3/7/95, 6:20am 3/7/95] 


i205 


T.Smith 


BA737 


[8:00am 16/11/95, 8:20am 16/11/95] 


£214 


T.Smith 


BATi7 


[8:10am 14/2/96, now] 



il58 


J.Adams 


UKIQQ 


i205 


T.Smith 


BA737 
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hgparts : Mparts * s a function that maps every Top gappy partitioning name o g to a 
Tsql2 SELECT statement E, such that FCN(L) = 0, and for every st £ CHRONS, it is 
true that eval(st, S) € SREL(1) and {fo(v) \ (v) £ eval(st, £)} is a gappy partitioning. 
h'gparts( a g) 1S intended to generate the relation to which h gpar t s (st) maps <r s (the relation 
that represents the members of the gappy partitioning). Assuming, for example, that 



the gregorian calendric relation of section 5.3.3 is available, h' gparts (sunday 9 ) could be 
flUg ) of page|210. 



hgparts is defined in terms of h' gparts . For every st £ CHRONS and o g £ GPARTS: 

hgparts(st)(a g ) = eval(st, h' gparts {a g )) 
The restrictions on h' gparts and the definition of h gpar t s (st) above satisfy the requirements 



on h g p arts of section . 



hcparts- I assume that for each complete partitioning used in the Top formulae, there 
is a corresponding Tsql2 granularity (section |5.2.2| ), h' cparts is a function that maps 
each Top complete partitioning name to an ordered pair (7, S), where 7 is the name 
of the corresponding TSQL2 granularity, and E is a SELECT statement that returns a 
relation representing the periods of the partitioning. More precisely, it must be the 
case that FCN(E) = 0, and for every st & CHRONS, eval(st,Y,) G SREL(l) and 
{/d(v) I (v) £ eval(st,T,)} is a complete partitioning. For example, if the gregorian 
relation of section |5.3.3| is available, h' cparts could map day c to (DAY, S), where S is 
( 5,49j ). ( 5.49j ) returns a one- attribute snapshot relation whose attribute values denote 



all the day-periods. 

(5.49) SELECT DISTINCT SNAPSHOT VALID (greg2) 
FROM (SELECT DISTINCT gregl.4 
VALID VALID (gregl) 
FROM gregorian AS gregl 
) (PERIOD) AS greg2 

hcparts is defined in terms of h' cparts . For every st £ CHRONS and a c £ CPARTS, if 
Kparts (<rc) = <7,S), then: 

hcparts (st) (a c ) = eval(st,T,) 
The restrictions on h' rts and the definition of h cpar t s (st) above satisfy the require- 



ments on h cpar ts of section 5/7. The 7 is used in the translation rule for For[a c , v q t y , < 
(appendix |A|). 
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5.10 Formulation of the translation problem 



Let us now specify formally what we want the Top to Tsql2 translation to achieve. I 
first define interp (interpretation of a resulting relation). For every eft G FORMS and 
every relation r: 



(5.50) interp(r, 4>) d = 



T, if € YNFORMS and r / 
F, if dp G YNFORMS and r = 

{(/d(^i), • • .,f D {v n )} | • ,«n) € r}, 

if 6 G WHFORMS 



Intuitively, if </> was translated to a SELECT statement that generated r, interp(r, <f>) 
shows how to interpret r. If <f) G YNFORMS (yes/no English question) and r / 8, 
the answer should be affirmative. If G YNFORMS and r = 0, the answer should be 
negative. Otherwise, if </> G WHFORMS (the English question contains interrogatives, 
e.g. "Who . . . ?", "When . . . ?"), the answer should report all the tuples of world 
objects (/d(«i), • • • , fD(v n )) represented by tuples (vi,...,v n ) G r. 

A translation function tr is needed, that maps every cf) G FORMS to a Tsql2 SELECT 
statement tr(cp), such that for every st G PTS, (5.51) and (5.52) hold. 

(5.51) FCN(tr((f>)) = 

(5.52) interp{eval(st, tr((f>)), 0) = \\4>\\ M ^' st 



M(st) must be as in section 5.6. As discussed in section 3.£, each (reading of an) 
English question is mapped to a Top formula (p. The answer must report ||<^|| M ( st )' ,s * . 
If tr satisfies (5.52), ||^>|| M ( S *) ,S * can be computed as interp(eval(st, tr(<j))), <p), by letting 
the Dbms execute tr(<p) (i.e. compute eval(st,tr (&)))■ 

tr will be defined in terms of an auxiliary function trans, trans is a function of two 
arguments: 

trans(4>, A) = S 

where <fi G FORMS, A is a Tsql2 value expression, and E a Tsql2 SELECT statement. 



A set of "translation rules" (to be discussed in section 5.11 ) specifies the E-values of 
trans. In practice, A always represents a period. Intuitively, A corresponds to Top's 
It. When trans is first invoked (by calling tr, discussed below) to translate a formula 
(f), A is set to PERIOD (TIMESTAMP 'beginning' , TIMESTAMP 'forever') to reflect the fact 
that Top's It is initially set to PTS (see the definition of ||</>|| M ' si in section |3.6| ), trans 
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may call itself recursively to translate subformulae of (j> (this will become clearer in 
following sections). When calling trans recursively, A may represent a period that 
does not cover the whole time-axis, to reflect the fact that already encountered Top 
operators may have narrowed It. 

I define tr as follows: 

def 

(5.53) tr(4>) = trans(4>, Xinit) 

def 

where \ ini t = PERIOD (TIMESTAMP 'beginning', TIMESTAMP 'forever'). Obviously, 
Xinit contains no correlation names, and hence FCN (\i n u) = 0- This implies that 
eval(st, Xinit, g dh ) does not depend on g db . Xi n u evaluates to the element of Dp that 
represents the period that covers the whole time-axis, i.e. for every st E PTS, it is true 



that eval(st, Xi n it) G Dp and fD(eval(st, Xinit)) = PTS. Therefore, lemma 5.1 holds 



Lemma 5.1 FCN(Xi n it) = 0, and for every st S PTS, eval(st, Xi n u) E Dp and 
f D (eval(st, Xmu)) = PTS. 



Using (|5.53|) , (5.51) and (5.52) become (5.54) and (5.55) respectively. The translation 
rules (that specify the values of trans for each <f> and A) must be defined so that for 
every <j) G FORMS and st <G PTS, (5.54) and (5.55) hold. 

(5.54) FCN (transit Xim t )) = 

(5.55) interp(eval(st, trans(<f>, X imt )), <p) = ||</>|| M(st) ' s * 



Appendix [A] proves that theorems 5.1 and [T^ hold for the translation rules of this 
thesis. 

Theorem 5.1 If € WHFORMS, st G PTS, trans(<j>, X init ) = E, and the total 
number of interrogative and interrogative-maximal quantifiers in <ft is n, then: 

1. FCiV(S) = 

2. eval(st,H) G SREL(n) 

3. {(f D ( Vl ),...,f D (v n )) | (v 1 ,...,v n )eeval(st,X)} = \\<t>\\ M W> 8t 



That is, the translation S of (ft contains no free column references, and it evaluates to 
a snapshot relation of n attributes, whose tuples represent ||(/>|| M ( S *)' ,S *. 
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Theorem 5.2 If G YNFORMS, st G PTS, A is a Tsql2 expression, g db G 
eval(st, A, G .Dp, r </> n = (ri, . . . , r n ), and £ = trans(4>, A), then: 

1. FCN(E) C FCN(X) 

2. eval{st,Y 1 ,g db ) G VREL P {n) 

3. (vi,..., v n ; t>t) G eval(st, £, <7 d6 ) iff for some g G G: 
|| ri ||M(^), g = ? ll^ir^^ = / D (« n ), and 

||^||M(st),stJ D MjD(emZ(stA/ ,, )),<? _ j> 



ri, T2, • • • , T n are all the constants in predicate argument positions and all the variables 
in 4> (section |5.5| ) . Clause 3 intuitively means that the tuples of eval(st, E, g db ) represent 
all the possible combinations of values of T\,...,T n and event times et, such that 
\M(st),st,et,lt,g _ T) where M ig the e i ement Q f PERIODS* represented by A. 



I now prove that theorems 5.1 and 5.2 imply that (5.54) and (5.55) hold for every 



st G PTS and 4> G FORMS, i.e. that trans has the desired properties. 

Proof of (5.54): Let st G PTS and <p G FORMS. We need to show that (5.54) holds. 
Since FORMS = WHFORMS U YNFORMS, the hypothesis that 4> G FORMS implies 
that </> G WHFORMS or G YNFORMS. In both cases (5.54) holds: 



If G WHFORMS, then by theorem gj, FCN(trans((j), Xmit)) = 0, i.e. (5.54) 
holds. 



If G YNFORMS , then by theorem |5.2| and lemma f^lj, the following holds, 
which implies that (5.54) also holds. 

FCN{trans{4>, \ im t)) Q FCN{\ imt ) = 



Proof of (5.55): Let st G PTS and cj) G FORMS. Again, it will either be the case 
that <p G WHFORMS or <p G YNFORMS. 



If ^ G WHFORMS, then by theorem 5.1 the following is true: 



{(/flW feW) I (vi,...,v n ) £eval(st, trans^, \ imt ))} = \m M ^ st 



The definition of interp, the hypothesis that </> G WHFORMS, and the equation above 
imply (5.55). 
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It remains to prove (5.55) for <fr £ YNFORMS. Let r <P = (n, . . . , r n ). By lemma [53], 
for every g dh £ G"*, eval(st, X init , g db ) = eval(st,X in u) G Dp and f D (eval(st, X in u)) = 
PTS. Also, (5.54) (proven above) implies that eval(st,trans((f), Xinit), g db ) does not 
depend on g db . Then, from theorem |5,2| we get ( |5.56D and (5.57). 



(5.56) eval(st,trans(cfr, Xi n it)) £ VRELp(n) 

(5.57) (vi, . . . , v n ; vt) £ eval(st, trans((f>, Xinit)) iff for some g £ G: 



\M(st),g = fj) ( Vl)i ... ) \\ Tn \\ M ^ = /D ( Un ) > and 
M(st),st,f D (v t ),PTS,g _ r 



The hypothesis that ^ G YNFORMS and the definition of interp imply that the left- 
hand side of (5.55) has the following values: 

{T, if eval(st, trans((j), Xmit)) ^ 
F, if eval(st, trans((j), Xinit)) = 

The hypothesis that (f> G YNFORMS and the definition of ||(/)|| M ( s *)> st (section |j| 
imply that the right-hand side of (5.55) has the following values: 

j T, if for some g G G and et G PERIODS, \\(f)\\M{st), 8 t,et,PTS,g _ r 
[ -F, otherwise 

Hence, to prove (5.55) it is enough to prove ( |5.58j ). 

(5.58) eval(st,trans(4>, Xinit)) 7^ iff 

for some g G G and et G PERIODS, \\<j,\\M(.*Q,*,et,PTSj = T 



I first prove the forward direction of ( 5.58| ). If it is true that eval(st, trans ((f), Xi n u)) 7^ 



by ( |5,56D eval(st, trans(4>, Xinit)) contains at least a tuple of the form (vi, . . . , u n ; i^) 
i.e. ( [5.59 ) is true. 



(5.59) . . . , v n ; v t ) G eval(st, trans((f>, X in it)) 

( 5.59Q and (|5.57 ) imply that for some g £ G, ( |5.60| ) holds. 



(5.60) m M(st),stJ D (v t ),PTS,g = T 

( 5.56j ) and ( |5.59D imply that vt is the time-stamp of a tuple in a relation of VRELp, 



which implies that fi)(vt) £ PERIODS . Let et = fi)(vt). Then, (5.60) becomes (|5.61), 
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where g G G and et = foiyt) G PERIODS . The forward direction of ( p. 58 ) has been 
proven. 

(5.61) || ||M( S t), S t,et,PTS >g = T 

I now prove the backwards direction of ( |5.58j ), I assume that g G G, et G PERIODS , 
and \\(j)\\M{st), s t,et,PTS,g = T Let Vf = /^(et), which implies that et = f D (v t ). Then 
(|5~62?) holds. 



(5.62) ||^||M(,t) lS t,/ D ( Vt ),Pra, g = T 



Let «! = f D \\\n\\ M ^' 9 ), ... ,v n = fDHlknf 1 ^ 119 )- This implies that ( ^63|) also 
holds. 

(5.63) llnir^' 9 = frfa), . . . , ||r n || M ( s *)^ = f D (v n ) 



( 5.63j ), ( |5.62| ), the hypothesis that g G G, and ( [5.57 ) imply ( 5.64| ), which in turn implies 



that eval(st, trans ((f), Xmit)) 7^ 0- The backwards direction of (|5.58 ) has been proven. 



(5.64) (vx, . . . , v n ; v t ) G eval(st, trans(<j), \ in i t )) 



This concludes the proof of (5.55). I have proven that trans satisfies (5.54) and (5.55) 
for every eft G FORMS and st G PTS, i.e. that trans has all the desired properties. 



5.11 The translation rules 



The values (SELECT statements) of trans are specified by a set of "translation rules". 
These rules are of two kinds: (a) base (non-recursive) rules that specify trans(4>, A) 
when cj) is an atomic formula or a formula of the form Culm[ir(Ti, . . . ,r n )]; and (b) 
recursive rules that specify trans((p, A) in all other cases, by recursively calling other 
translation rules to translate subformulae of <j). In this section, I attempt to convey the 
intuitions behind the design of the translation rules, and to illustrate the functionality 
of some representative rules. 

In the case of a yes/no formula (ft, the aim is for the resulting SELECT statement to 
return a relation of VREIp(n) that shows all the combinations of event-times et and 
values of t\, ... ,r n ({t%, . . . ,r n ) = r <jP) for which <f> is satisfied. More precisely, the 
tuples of the relation must represent all the combinations of event times et and world 
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objects assigned (by f CO ns(st) and some variable assignment g) to ri, . . . , r n , for which 
\\<j,\\M(st),st,et,lt,B _ y ; where tt is the e i ement Q f PERIODS* represented by A. In each 

tuple (vi, . . . ,v n ',vt) , vt represents et, while vi,...,v n represent the world objects of 
ti, . . . , T n . For example, the rule for predicates is as follows: 

Translation rule for predicates: 

trans(ir(ri, . . . , r n ), A) d = 

(SELECT DISTINCT a.l, a.2, ... , a.n 
VALID VALID (a) 

FROM (/^ /u „ s (7T,n))(SUBPERI0D) AS a 
WHERE . . . 
AND . . . 

AND . . . 

AND A CONTAINS VALID (a)) 
where the "... "s in the WHERE clause stand for all the strings in Si U S2, and: 

S 1 = = h' cons {Tif I i £ {1,2,3,... ,n} and n £ CONS} 

S2 = { "a.i = a.j" I i,j £ {1,2,3,... ,n}, i < j, n = Tj, and Ti,Tj £ VARS} 



I assume that whenever the translation rule is invoked, a new correlation name a is 
used, that is obtained by calling a generator of correlation names. Whenever called, 
the generator returns a new correlation name that has never been generated before. I 
assume that the correlation names of the generator are of some distinctive form (e.g. 
tl, t2, t3, ... ), and that the correlation names in the SELECT statements returned by 
hpfunsi Kuimsi ^'cpartsi an d h' gparts are not of this distinctive form. I also assume that 
some mechanism is in place to ensure that no correlation name of the distinctive form of 
the generator can be used before it has been generated. The use of the generator means 
that trans is strictly speaking not a pure function, since the same ir and ri, . . . , r n lead 
to slightly different SELECT statements whenever trans (tt(ti, . . . , r n ), A) is computed: 
each time the resulting statement contains a different a (similar comments apply to 
other translation rules). There are ways to make trans a pure function, but these 
complicate the translation rules and the proof of their correctness, without offering 
any practical advantage. 

Let us consider, for example, the predicate inspecting (il58, j -adams , ukl60) . Accord- 
ing to section [O], \\inspecting(H58, j-adams, ukl60)\\ M( - st ^' st ' et ' lt ' 9 =T iff et Qlt and 
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P G f P funs(st)(inspecting,3)(\\il58\\ M ^ 9 , \\j_adams\\ M ^ 9 , \\ukl60\\ M ^ 9 ) 
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Let us assume that h' p j uns (inspecting,3) and h p f uns (st) (inspecting, 3) are ( 5.45| ) and 
( 5.47] ) respectively (p. [228D , that il58, j-adams, and ii/cl60 correspond to the obvious 
attribute values of (|5.47| ), and that A is PERIOD ' [9:00am 1/5/95 - 9:30pm 1/5/95] '. 



It is the period represented by A. By the definition of f p f uns of section 



f P f U ns(st)(inspecting,3)(\\il58\\ M ^ 9 , \\j.adams\\ M ^ 9 , \\ukl60\\ M ^> 9 ) = { Pl ,p 2 } 



where p\ and p 2 are the periods of the first two tuples of ( 5.47 ). The denotation of 
inspecting (il58, j _adams , ukl60) is T for all the ets that are subperiods of p\ or p 2 
and also subperiods of It. 

The translation rule above maps inspecting(H58,j-adams,ukl60) to (|5.65|) , where 



h'p^^inspecting^) is the SELECT statement of ( j5.45| ) (that returns ( |5.47D ), 



(5.65) 



(SELECT DISTINCT tl.l, tl.2, tl.3 
VALID VALID(tl) 

FROM (h' pfuns (inspecting, 3)) (SUBPERIOD) AS tl 
WHERE tl.l = 'il58' 

AND tl.2 = 'J.Adams' 

AND tl.3 = 'UK160' 

AND PERIOD '[9:00am 1/5/95 - 9:30pm 1/5/95]' CONTAINS VALID(tl)) 



( |5.65| ) returns (5.66), where the time-stamps correspond to all the subperiods of p± 



and p2 (pi and p 2 are the periods of the first two time-stamps of (5.47)) that are also 
subperiods of It (the period represented by A). 

(5.66) 



il58 


J.Adams 


UK160 


[9 


00am 1/5/95, 9:30pm 1/5/95] 


il58 


J.Adams 


UK160 


[9 


10am 1/5/95, 9:15pm 1/5/95] 


il58 


J.Adams 


UK160 


[9 


20am 1/5/95, 9:25pm 1/5/95] 



In other words, the time-stamps of ( |5,66| ) represent correctly all the ets where the de- 
notation of inspecting(H58, j -adams , ukl60) is T. In this example, all the predicate 
arguments are constants. Hence, there can be no variation in the values of the argu- 



ments, and the values of the explicit attributes in ( 5.66 ) are the same in all the tuples. 
When some of the predicate arguments are variables, the values of the corresponding 
explicit attributes are not necessarily fixed. 
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The S2 constraints in the WHERE clause of the translation rule are needed when the 
predicate contains the same variable in more than one argument positions. In those 
cases, S2 requires the attributes that correspond to the argument positions where 
the variable appears to have the same values. 52 contains redundant constraints when 
some variable appears in more than two argument positions. For example, in ir(/3, (3, (5) 
(f3 G VARS), S2 requires the tuples (vi, V2, V3; Vt) of the resulting relation to satisfy: 
Vi = t>2, v\ = t>3, and V2 = V3. The third constraint is redundant, because it follows 
from the others. The prototype Nlitdb employs a slightly more complex definition 
of S2 that does not generate the third constraint. Similar comments apply to the rule 
for Culm[ir(Ti, . . . , r n )] below, and the rules for conjunction, At[(f>i, fa], Before[(f>i, (fa], 
and After[(f)i, (fa] (appendix^). 

Translation rule for Culm[7r(ri, . . . ,T n )]: 

def 

trans (Culm[ir(Ti, . . . ,r n )], A) = 

(SELECT DISTINCT a-i.l, ai.2, ... , a\.n 
VALID PERIOD (BEGIN (VALID (ai)) , END (VALID (ai) ) ) 
FROM (h' p f uns (tt, n)) (ELEMENT) AS a lt 

WHERE ai.l = a 2 .l 
AND ai.2 = a 2 .2 

AND a±.n = a^-n 
AND ... 

AND . . . 

AND A CONTAINS PERIOD (BEGIN (VALID (ai) ) , END (VALID (ai) ) ) 
Whenever the rule is used, a\ and 0.2 are two new different correlation names, obtained 
by calling the correlation names generator after A has been supplied. The "... "in 
the WHERE clause stand for all the strings in Si U ^2, where S\ and S2 are as in the 
translation rule for predicates, except that a is now ot\. 

The rule for Culm\n{ri, . . . ,t„)] is similar to that for 7t(ti, . . . ,r n ). The resulting 
SELECT statement returns an element of VRELp{n) that shows the ets and the val- 
ues of the predicate arguments for which the denotation of Culm[iT{j\, . . . ,T n )] is 
T. In the case of Culm[it{T\, . . . ,T n )}, however, the generated relation contains only 
tuples (vi, . . . ,v n ;v t ), for which (v\, . . . ,v n ) appears in h cu i ms (st)(Tr,n) (the relation 
returned by h' culms (ir, n)). That is, the situation of 7t(ti, . . . , r n ) must reach its cli- 
max at the latest time-point where it is ongoing. Also, h p f uns (st)(Tr,n) (the rela- 
tion returned by h' t uns {'K,n)) is coalesced using (ELEMENT). This causes all tuples of 
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hpf uns (st)(ir,n) that refer to the same situation to be merged into one tuple, time- 
stamped by a temporal element that is the union of all the periods where the sit- 
uation is ongoing. Let us refer to this coalesced version of h p fi lns (st)(iT,n) as r. 
cti ranges over the tuples of r, while a<i over the tuples of h cu i ms (st)(ir,n). The 
relation returned by trans(Culm[rr(Ti, . . . , r n )], A) contains all tuples {v%, . . . , v n ; v t ), 
such that (v\, . . . , v n ; v' t ) G r, vt represents the period that starts at the beginning 
of the temporal element of v' t and ends at the end of the temporal element of v' t , 
(v\, . . . ,v n ) £ h cu i ms (st)(i:,n), and vts period (i.e. et) is a subperiod of A's period (i.e. 
It). S\ and S2 play the same role as in the translation rule for predicates. 



Let us assume that h' p ^ uns {inspecting ,3) and h' culms (inspecting,3) are ( |5.45|) and (|5.46 ) 



respectively, that h p f uns (st) (inspecting, 3) and h cu i ms (st) (inspecting, 3) are (|5.47| ) and 
(|5.48j ), and that A = PERIOD ' [1/5/95 - 18/11/95] '. The translation rule above maps 



Culm [inspecting(occr v , person" , flight 11 )] to |[67|). 



(5.67) (SELECT DISTINCT tl.l, tl.2, tl.3 

VALID PERIOD(BEGIN(VALID(tl) ) , END ( VALID (t 1) ) ) 
FROM (h' pfuns (inspecting, 3)) (ELEMENT) AS tl, 

(h' culms (inspecting,3)) AS t2 
WHERE tl.l = t2.1 
AND tl.2 = t2.2 
AND tl.3 = t2.3 

AND PERIOD ' [1/5/95 - 18/11/95] ' CONTAINS 

PERIOD(BEGIN(VALID(tl) ) , END ( VALID (t 1) )) ) 



( 5.67| ) returns ( |5,68f ). There is (correctly) no tuple for inspection iWO: the semantics 



of Culm (section |3.9| ) requires the inspection to reach its completion at the latest time- 
point where it is ongoing; according to ( |5.48j ), this is not the case for il60. There is 
also (correctly) no tuple for i214: the semantics of Culm requires et (the time of the 
inspection) to be a subperiod of It (A's period), but i214 does not occur within It. 
Finally, ( ^.68| ) does not contain tuples for the subperiods of [9:00am 1/5/95, 10:25am 
1/5/95] and [8:00am 16/11/95, 8:20am 16/11/95]. This is in accordance with the 
semantics of Culm, that allows Culm[inspecting(occr v , j_adams,ba737)] to be true 
only at ets that cover entire inspections (from start to completion). 

(5.68) 



il58 
z205 


J.Adams 
T.Smith 


UK160 
BA1Z1 


[9:00am 1/5/95, 10:25am 1/5/95] 
[8:00am 16/11/95, 8:20am 16/11/95] 



All the other translation rules for yes/no formulae are recursive. For example, Past[f3, <j)'\ 
is translated using the following: 
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Translation rule for Past[[3,(ft']: 
trans(Past[f3, <//], A) = 

(SELECT DISTINCT VALID (a) , a.l, a. 2, ... , a.n 
VALID VALID (a) 
FROM transit , A') AS a) 

A' is the expression INTERSECT (A, PERIOD (TIMESTAMP 'beginning', TIMESTAMP 'now' - 

INTERVAL '1' x))> X stands for the TSQL2 name of the granularity of chronons (e.g. 

DAY), and n is the length of r </> n . Whenever the rule is used, a is a new correlation 

name obtained by calling the correlation names generator. 

The rule for Past[/3,(ft'] calls recursively trans to translate (ft', (ft' is translated with 
respect to A', which represents the intersection of the period of the original A with the 
period that covers all the time up to (but not including) the present chronon. This 
reflects the semantics of Past (section |3.8|) , that narrows It to It D [tfi rs t,st). The 
relation returned by trans (Past [0, (j)'], A) is the same as that of trans((ft' , A'), except 
that the relation of trans(Past{(3, (j)'], A) contains an additional explicit attribute, that 
corresponds to the (3 of Past[(3, (j)']. The values of that attribute are the same as the cor- 
responding time-stamps (that represent et). This reflects the semantics of Past[(3, (j)'], 
that requires the value of to be et. As a further example, At[n, (ft'] (n G CONS) is 
translated using the following: 

Translation rule for At[n,(ft']: 

trans(At[K, (ft'], A) = trans((f>', A'), where A' is INTERSECT (A , h' cons (K)). 

The translation of At[n, (j)'] is the same as the translation of (ft', but (ft' is translated 
with respect to A', which represents the intersection of A's period with that of n. This 
reflects the fact that in At[K,(ft'], the At narrows It to the intersection of the original 
It with k's period. There are separate translation rules for At[a c , (3, (ft'], At[a g , (3, (ft'], 
and Atifa,^] (a c G CPARTS, a g G GPARTS, and (ft',(fti,(ft2 G YNFORMS). 

The complete set of translation rules for yes/no formulae is given in appendix along 



with a formal proof that trans((ft, A) satisfies theorem 5.2. Theorem 5.2 is proven by 
induction on the syntactic complexity of (ft. I first prove that theorem 5.2 holds if (ft is 
a predicate or Cw/to[7t(ti, . . . ,r n )]. For all other (ft G YNFORMS , (ft is non-atomic. In 
those cases, I prove that theorem |5.2| holds if it holds for the subformulae of (ft. 



Let us now consider wh-formulae. These have the form 1(3\ 1(32 ?/?3 • • • iPk (ft' or 
ImxiPi ?/?2 103 ••• 10k ft, wh ere (ft' G YNFORMS (section U)- The first case is 
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covered by the following rule. (The rules for wh-formulae define trans(4>, A) only for 
A = Xinit- The values of trans for <j> E WHFORMS and A 7^ Xi n it are not used anywhere 
and can be chosen arbitrarily. Intuitively, for <j> 6 WHFORMS the goal is to define 



trans(4>, A) so that it satisfies theorem 5.1. That theorem is indifferent to the values 
of trans for A 7^ \i n it ■) 

Translation rule for 10 x ?/3 2 ?/%... ?A </>': 

trans(?/3! ?/? 2 ?/% . . . Wk <t>', Xinit) = f 

(SELECT DISTINCT SNAPSHOT a.Wi , a.w 2 . ■•■ , a.w fc 
FROM trans((j)' , Xmit) AS a) 

Whenever the rule is used, a is a new correlation name, obtained by calling the correla- 
tion names generator. Assuming that r cj) n = (n, . . . , r n ), for every i £ {1,2,3,..., k}: 

(jji = min({j I j e {1,2,3, ... , n} and Tj = 0j}) 

That is, the first position (from left to right) where 0i appears in (n, . . . , r n ) is the Wj-th 
one. Intuitively, we want 10x 102 103 • • • 10k 0' to be translated to a SELECT statement 
that returns a snapshot relation, whose tuples represent ?/?2 • • • ?/3fc ( ^ / || M ( s *)' s *. 
According to section |J, p.f3 x ?/3 2 ?)%... 7/9fc </.'|| M ( st )' si is the set of all tuples that 
represent combinations of values assigned to ...,/?& by some g £ G, such that for 
some et G PERIODS, \\(j)>\\M(st), s t,et,PTS,g = T 



By theorem 5.2 , the relation returned by trans((j)', Xinit) ( see the translation rule) is a 
valid-time relation, whose tuples show all the possible combinations of ets and values 
assigned (by f CO ns(st) and some g E G) to r\, . . . ,r n , for which \\ ( j ) l \\ M ( st )' st > et > PTS >9 = 
T. The syntax of Top (section |3.2|) guarantees that 0i,...,0k appear within 4>' . 
This in turn guarantees that fix,... ,0k appear among ri, ... , r n , i.e. the relation of 
trans{4>' , Xi n it) contains attributes for 0x, ■ ■ ■ , 0k- To find all the possible combinations 
of values of 0x,---,0k for which (for some et) ||<^ / || M ( s *)' si ' ei ' PTS '>9 = T, we simply 
need to pick (to "project" in relational terms) from the relation of trans (<// , Xinit) the 
attributes that correspond to 0x, ■ ■ ■ , 0k- Fo r i G {1, 2, 3, ... , k}, 0i may appear more 
than once in <fi' . In this case, the relation of trans(4>' , Xi n u) contains more than one 
attributes for 0i (these attributes have the same values in each tuple). We only need 
to project one of the attributes that correspond to 0i. The translation rule projects 
only the first one; this is the cjj-th attribute of trans{4>' , Xinit), the attribute that 
corresponds to the first (from left to right) tj in (n, . . . , r n ) that is equal to 0i. 
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Let us consider, for example, the following wh-formula ("Who inspected what?"): 

(5.69) ?wl v ?w2 v Past[e v , Culm[inspecting(occr v ,wl v ,w2 v )]] 

Here, <p' = Past[e v , Culm[inspecting(occr v ,wl v ,w2 v )]] and r </>' n = {e v , occr v , wl v , w2 v ). 
Let us assume that trans(4>' , Xmit) returns ( |5.70| ). ( ^.70[) shows all the possible combi- 
nations of ets and values that can be assigned by some g G G to e v , occr v , wl v , and 
w2 v , such that \\(j)'\\ M ( st )' st ' et ' p TS,g = T. In every tuple, the time-stamp is the same 
as the value of the first explicit attribute, because the semantics of Past requires the 
value of e" (represented by the first explicit attribute) to be et (represented by the 
time-stamp). To save space, I omit the time-stamps of ( |5.70| ). 

(5.70) 



[9:00am 1/5/95, 3:00pm 1/5/95] 


il58 


J.Adams 


UKim 




[10:00am 4/5/95, 11:30am 4/5/95] 


H65 


J.Adams 


BA737 




[7:00am 16/11/95, 7:30am 16/11/95] 


i204 


T.Smith 


£7X160 





To generate the snapshot relation that represents ||?wl 1 ' ?w2 v ^'|| M ( si )' s * ) i.e. the rela- 
tion that shows the combinations of values of wV and w2 v for which (for some et and 
g) \\Past[e v , Culm[inspecting(occr v ,wl v ,w2 v )]]\\ M ( st }' st ' et ' PTS ' 9 = T, we simply need 
to project the explicit attributes of ( 5.7CQ that correspond to wV and w2 v . The first 
positions where wl v and w2 v appear in r (p > ~ 1 = (e v , occr v , wl v , w2 v ) are the third and 
fourth (i.e. uj% = 3 and 002 = 4). Hence, we need to project the third and fourth ex- 
plicit attributes of ( |5.70| ). The translation rule for . . . If5 k <p' maps Q5.69 ) to ( 5.71| ), 
which achieves exactly that (it returns ( |5.72| )). 



(5.71) 
(5.72) 



(SELECT DISTINCT SNAPSHOT tl.3, tl.4 
FROM trans(Past[e v , Culm[inspecting(occr v , wV , w2 v )\\, Xinit) 



AS tl) 



J.Adams 


£7X160 


J.Adams 


BATH 


T.Smith 


[7X160 



Wh-formulae of the form l mxi Pi ?/3 2 ?As • • • ?/?fc 4>' G YNFORMS) are translated 
using the following: 

Translation rule for ? mx ; 
trans(l mx if3i ?/3 2 ?/% • • • ?/?fc 4>', Knit 



def 



(SELECT DISTINCT SNAPSHOT VALID (a 2 ) , a 2 .2, a 2 .3, ... , a 2 ■ k 
FROM (SELECT DISTINCT 'dummy', cti.uj 2 , ai.u> 3 , ... , ai.LU k 
VALID ai.Wi 

FROM trans((j)' ,\ im t) AS ai 
) (N0SUBPERI0D) AS a 2 ) 
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Whenever the rule is used, a\ and cti are two different new correlation names, ob- 
tained by calling the correlation names generator. Assuming that — • • • 
uji, . . . , ook are as in the rule for 1(3% . . . ?/3j~ <f>. That is, the first position (from left to 
right) where /3j appears in (n, . . . , r n ) is the Wj-th one. 



Let us consider, for example, ( |5.73| ) ("What circled when. 7 '). 
(5.73) 7 mxl e v ?w v Past[e v ,circling(w v )] 

Let us also assume that trans(Past[e v , cir cling (w v )], \i n it) returns ( 5.74 ). In this case, 
(f)' = Past[e v , circling(w v )] and r </> n = {e v ,w v ). ( |5.74| ) shows all the combinations of 
ets and values of e v and w v , for which the denotation of Past[e v , cir cling {w v )] is T. In 
each tuple, the value of the first explicit attribute (that corresponds to e v ) is the same 
as the time-stamp, because the semantics of Past requires the value of e v to be the 
same as et (represented by the time-stamp). To save space, I omit the time-stamps. 

(5.74) 



[5:02pm 22/11/95, 5:17pm 22/11/95] 
[5:05pm 22/11/95, 5:15pm 22/11/95] 
[5:07pm 22/11/95, 5:13pm 22/11/95] 

[4:57pm 23/11/95, 5:08pm 23/11/95] 
[4:59pm 23/11/95, 5:06pm 23/11/95] 
[5:01pm 23/11/95, 5:04pm 23/11/95] 



[8:07am 22/11/95, 8: 
[8:08am 22/11/95, 8: 
[8:09am 22/11/95, 8: 



19am 22/11/95] 
12am 22/11/95] 
10am 22/11/95] 



BA737 
BA737 
BA737 

BA737 
BA737 
BA737 

[7X160 
[7X160 
[7X160 



BA737 was circling from 5:02pm to 5:17pm on 22/11/95, and from 4:57pm to 5:08pm 
on 23/11/95. UK160 was circling from 8:07am to 8:19am on 22/11/95. (j5774j) also 
contains tuples for the subperiods of these periods, because circling(w v ) (like all Top 
predicates) is homogeneous (section [31]). Past[e v ,circling(w v )] is true at all these 
subperiods that end before the present chronon. 

In our example, the embedded SELECT statement of trans(? mx i(3i ?/?2 ?/% • • • ?/3yfc <j>' , \nit) 
is: 

(5.75) (SELECT DISTINCT 'dummy', tl.2 
VALID tl.l 

FROM trans(Past[e v ,circling(w v )],X im t) AS tl) 



(|5.75j ) generates ( |5.76 ), where the time-stamps are the values of the first explicit at- 
tribute of ( 5.74| ) (i.e. they correspond to e v ). The 'dummy' in the embedded SELECT 
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statement (( |5.75| ) in our example) means that the first explicit attribute of that state- 
ment's resulting relation should have the string "dummy" as its value in all tuples. This 
is needed when k = 1. If, for example, fl5.73| ) were ? m xie v Past[e v , circling (6a737)], 
without the 'dummy' the SELECT clause of ( |5.75| ) would contain nothing after DISTINCT 
(this is not allowed in Tsql2). 



(5.76) 



dummy 


BA1Z1 


[5 


02pm 22/11/95, 5:17pm 22/11/95] 


dummy 


BA737 


[5 


05pm 22/11/95, 5:15pm 22/11/95] 


dummy 


BA737 


[5 


07pm 22/11/95, 5:13pm 22/11/95] 


dummy 


BA737 


[4 


57pm 23/11/95, 5:08pm 23/11/95] 


dummy 


BA737 


[4 


59pm 23/11/95, 5:06pm 23/11/95] 


dummy 


BA737 


[4 


59pm 23/11/95, 5:06pm 23/11/95] 


dummy 


UK160 


[8 


07am 22/11/95, 8:19am 22/11/95] 


dummy 


UK160 


[8 


08am 22/11/95, 8:12am 22/11/95] 


dummy 


UK160 


[8 


09am 22/11/95, 8:10am 22/11/95] 



The (N0SUBPERI0D) of the translation rule removes from ( 5.76| ) any tuples that do not 
correspond to maximal periods. That is (5.76) becomes ( 5.77| ). 



(5.77) 



dummy 
dummy 
dummy 



BA737 
BA737 
UK160 



[5:02pm 22/11/95, 5:17pm 22/11/95] 
[4:57pm 23/11/95, 5:08pm 23/11/95] 
[8:07am 22/11/95, 8:19am 22/11/95] 



The overall ( 5.73 ) is mapped to ( |5.78 ), which generates ( |5.79| ). ( 5.7S| ) represents the 
denotation of ( |5.73[ ) w.r.t. M(st) and st (pairs of maximal circling periods and the 
corresponding flights). 



(5.78) 



(5.79) 



(SELECT DISTINCT SNAPSHOT VALID (t2) , t2.2 
FROM (SELECT DISTINCT 'dummy', tl.2 
VALID tl.l 

FROM trans(Past[e v , circling(w v )], A imf ) AS tl 
) (N0SUBPERI0D) AS t2) 



[5:02pm 22/11/95, 5:17pm 22/11/95] 
[4:57pm 23/11/95, 5:08pm 23/11/95] 
[8:07am 22/11/95, 8:19am 22/11/95] 



BA7Z7 
BA737 
UK160 



Appendix ^ proves that the translation rules for wh-formulae satisfy theorem |5,1| , 
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(SELECT DISTINCT SNAPSHOT t4.3 

FROM (SELECT DISTINCT VALID (t3) , t3.1, t3.2, t3.3 
VALID VALID (t3) 

FROM (SELECT DISTINCT tl.l, tl.2, tl.3 

VALID PERIOD(BEGIN(VALID(tl) ) , END ( VALID (t 1) ) ) 
FROM (SELECT DISTINCT insp.l, insp.2, insp.3 
VALID VALID (insp) 

FROM inspections (PERIOD) AS insp) (ELEMENT) AS tl, 
(SELECT DISTINCT SNAPSHOT inspcmpl . 1 , inspcmpl.2, inspcmpl.3 
FROM inspections AS inspcmpl 
WHERE inspcmpl. 4 = 'complete') AS t2 
WHERE tl.l = t2.1 AND tl.2 = t2.2 
AND tl.3 = t2.3 AND tl.3 = 'UK160' 
AND INTERSECT ( 
INTERSECT ( 

PERIOD (TIMESTAMP 'beginning', TIMESTAMP 'forever'), 
PERIOD 'today' - INTERVAL '1' DAY), 
PERIOD (TIMESTAMP 'beginning', 

TIMESTAMP 'now' - INTERVAL '1' MINUTE)) 
CONTAINS PERIOD(BEGIN(VALID(tl) ) , END ( VALID (t 1) ) ) 

) AS t3 
) AS t4) 

Figure 5.3: Example of generated Tsql2 code 



5.12 Optimising the generated TSQL2 code 

The generated Tsql2 code is often verbose. There are usually ways in which it could 



be shortened and still return the same results. Figure 5.3, for example, shows the code 



that is generated by the translation of ( 5.8C| ), if chronons correspond to minutes. (( 5.8C| ) 



expresses the reading of "Who inspected UK160 yesterday?" where the inspection must 
have both started and been completed on the previous day.) 

(5.80) ?w v At[y ester day, Past[e" , Culm{inspecting(occr v ,w v ,ukl60)}]] 



I assume here that h' p j: uns (inspecting,3) and h' culms (inspecting,3) are ( 5.45 ) and ( 5.46 ) 



respectively. The embedded SELECT statements of figure |5.3| that are associated with tl 
and t2 are ( 5.45[) and (|5.46| ). The embedded SELECT statement that is associated with t3 



corresponds to Culm[inspecting(occr v ,w v ,uklQ0)] (see the rule for Culm[ir(Ti, . . . , r ra )] 
in section 5.1l| ) . It generates a relation whose explicit attributes show all the combina- 



tions of codes, inspectors, and inspected objects that correspond to complete inspec- 
tions. The time-stamps of this relation represent periods that cover whole inspections 
(from start to completion). The last constraint in the WHERE clause (the one with 
CONTAINS) admits only tuples whose time-stamps (whole inspections) are subperiods 
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of It. The two nested INTERSECTS before CONTAINS represent It. The At has narrowed 
It to the intersection of its original value (whole time-axis) with the previous day 
(PERIOD 'today' - INTERVAL '1' DAY)). The Past has narrowed It further to the inter- 
section with [tfi rs t,st) (PERIOD (TIMESTAMP 'beginning', TIMESTAMP 'now' - INTERVAL 
'1' MINUTE)). 

The embedded SELECT statement that is associated with t4 is generated by the trans- 
lation rule for Past[(3,4>'] (section |5.1l| ). It returns the same relation as the statement 
that is associated with t3, except that the relation of t4's statement has an addi- 
tional explicit attribute that corresponds to the first argument of Past. In each tuple, 
the value of this extra attribute is the same as the time-stamp (et). The topmost 
SELECT clause projects only the third explicit attribute of the relation returned by t4's 
statement (this attribute corresponds to w v of (|5.80Q ). 



The code of figure could be shortened in several ways. t4's statement, for example, 
simply adds an extra attribute for the first argument of Past. In this particular case, 
this extra attribute is not used, because ( ^.80 ) contains no interrogative quantifier 
for the first argument of Past. Hence, t4's statement could be replaced by t3's (the 
topmost SELECT clause would have to become SELECT DISTINCT SNAPSHOT t3.2). One 
could also drop the top-level SELECT statement, and replace the SELECT clause of t3's 
statement with SELECT DISTINCT SNAPSHOT tl.2. Furthermore, the intersection of the 
whole time-axis (PERIOD (TIMESTAMP 'beginning', TIMESTAMP 'forever')) with any pe- 
riod p is simply p. Hence, the second INTERSECT ( . . . , ... ) could be replaced by its 



second argument. The resulting code is shown in figure 5.4. Further simplifications 
are possible. 

Most Dbmss employ optimisation techniques. A commercial Dbms supporting Tsql2 
would probably be able to carry out at least some of the above simplifications. Hence, 
the reader may wonder why should the Nlitdb attempt to optimise the Tsql2 code, 
rather than delegate the optimisation to the DBMS. First, as mentioned in section 



1.2.4, only a prototype Dbms currently supports Tsql2. Full-scale Tsql2 Dbmss with 



optimisers may not appear in the near future. Second, long database language queries 
(like the ones generated by the framework of this thesis) can often confuse generic Dbms 
optimisers, causing them to produce inefficient code. Hence, shortening the Tsql2 
code before submitting it to the DBMS is again important. It would be interesting 
to examine if optimisations like the ones discussed above could be automated, and 
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(SELECT DISTINCT SNAPSHOT tl.2 
FROM (SELECT DISTINCT insp.l, insp.2, insp.3 
VALID VALID(insp) 

FROM inspections (PERIOD) AS insp) (ELEMENT) AS tl, 
(SELECT SNAPSHOT inspcmpl . 1 , inspcmpl.2, inspcmpl.3 
FROM inspections AS inspcmpl 
WHERE inspcmpl. 4 = 'complete') AS t2 
WHERE tl.l = t2.1 AND tl.2 = t2.2 
AND tl.3 = t2.3 AND tl.3 = 'UK160' 
AND INTERSECT (PERIOD 'today' - INTERVAL '1' DAY, 
PERIOD (TIMESTAMP 'beginning', 

TIMESTAMP 'now' - INTERVAL '1' MINUTE)) 
CONTAINS PERIOD(BEGIN(VALID(tl)) , END (VALID (t 1) ) ) 

Figure 5.4: Shortened Tsql2 code 



integrated into the framework of this thesis as an additional layer between the Top to 
Tsql2 translator and the Dbms. I have not explored this issue. 



5.13 Related work 



Various mappings from forms of logic to and from relational algebra (e.g. [Unman 88], 



[ Van Gelder fc Topor 91| ), from logic programming languages to SQL (e.g. [ Lucas 



Ipraxler 92| 1), and from logic formulae generated by Nlidbs to SQL ( fLowden et al. 91a |, 
|Androutsopoulos 92 ], | Androutsopoulos et al. 95 ], | Rayner 93[| ) have been discussed in 
the past. The mapping which is most relevant to the Top to Tsql2 translation of this 
chapter is that of [ Boehlen et al. 96fl . 

Boehlen et al. study the relation between Tsql2 and an extended version of first 
order predicate logic (henceforth called Sul), that provides the additional temporal 
operators • (previous), o (next), since, and until. Sul is point-based, in the sense 
that Sul formulae are evaluated with respect to single time-points. Sul assumes that 
time is discrete. Roughly speaking, »(f) is true at a time-point t iff (j) is true at the 
time-point immediately before t. Similarly, o<p is true at t iff <j) is true at the time-point 
immediately after t. 4>\ since (j>2 is true at t iff there is some t' before t, such that (f>2 
is true at t' , and for every t" between t' and t, (f>\ is true at t" . Similarly, 4>i until <p2 
is true at t iff there is some t' after t, such that (f>2 is true at t' , and for every t" 
between t and t', <p\ is true at t" . Various other operators are also defined, but these 
are all definable in terms of •, o, since, and until. For example, 04> is equivalent to 
true since 4> {true is a special formula that is true at all time-points). In effect, is 
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true at t if there is a t' before t, and (ft is true at t'. For example, (5.81) and ( |5.83 ) can 
be expressed as ( 5.82 ) and ( 5.84| ) respectively. 



(5.81) BA737 departed (at some past time). 

(5.82) <)depart(ba737) 

(5.83) Tank 2 has been empty (all the time) since BA737 departed. 

(5.84) empty (tank2) since depart(ba737) 

Boehlen et al. provide rules that translate from Sul to Tsql2. (They also show how to 
translate from a fragment of Tsql2 back to Sul, but this direction is irrelevant here.) 
The underlying ideas are very similar to those of this chapter. Roughly speaking, there 
are non-recursive rules for atomic formulae, and recursive rules for non-atomic formu- 
lae. For example, the translation rule for (pi since </>2 calls recursively the translation 
algorithm to translate <j)\ and (f>2- The result is a SELECT statement, that contains two 
embedded SELECT statements corresponding to 4>\ and fo- Devising rules to map from 
Sul to Tsql2 is much easier than in the case of Top, mainly because Sul formulae are 
evaluated with respect to only one time-parameter (Top formulae are evaluated with 
respect to three parameters, st, et, and It), Sul is point-based (Top is period-based; 
section |3.1| ), and Sul provides only four temporal operators whose semantics are very 
simple (the Top version of this chapter has eleven operators, whose semantics are 
more complex). Consequently, proving the correctness of the Sul to Tsql2 mapping 
is much simpler than in the case of Top. 

It has to be stressed, however, that Top and Sul were designed for very different 
purposes. Sul is interesting from a theoretical temporal-logic point of view. Roughly 
speaking, it has been proven that whatever can be expressed in traditional first-order 
predicate logic with a temporal precedence connective by treating time as an extra 
predicate argument (e.g. ( |1.12 ) of page|6|) can also be expressed in first-order predicate 



logic enhanced with only a since and an until operator, subject to some continuity 



conditions (the reverse is not true; see chapter II. 2 of [ vanBenthem 83 |). The mapping 
from Sul to Tsql2 (and the reverse mapping from a fragment of Tsql2 to Sul) is 
part of a study of the relative expressiveness of Sul and Tsql2. The existence of a 
mapping from Sul to Tsql2 shows that Tsql2 is at least as expressive as Sul. (The 
reverse is not true. Full Tsql2 is more expressive than Sul; see jBoehlen et al. 96| .) 



In contrast, Top was not designed to study expressiveness issues, but to facilitate 
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the mapping from (a fragment of) English to logical form. Chapter || showed how to 
translate systematically from a non-trivial fragment of English temporal questions into 
Top. No such systematic translation has been shown to exist in the case of Sul, and 
it is not at all obvious how temporal English questions (e.g. containing progressive and 
perfect tenses, temporal adverbials, temporal subordinate clauses) could be mapped 
systematically to appropriate Sul formulae. 

Although the study of expressiveness issues is not a goal of this thesis, I note that the 
Top to Tsql2 translation of this chapter implies that Tsql2 is at least as expressive as 
Top (every Top formula can be mapped to an appropriate Tsql2 query). The reverse 
is not true: it is easy to think of Tsql2 queries (e.g. queries that report cardinalities 
of sets) that cannot be expressed in (the current version of) Top. Finally, neither 
Top nor Sul can be said to be more expressive than the other, as there are English 
sentences that can be expressed in Sul but not Top, and vice-versa. For example, 
the Sul formula ( |5.84| ) expresses ( |5.83D , a sentence that cannot be expressed in Top. 
Also, the Top formula ( |5.86| ) expresses ( |5.85 ). There does not seem to be any way to 
express ( 5.85| ) in Sul. 



(5.85) Tank 2 was empty for two hours. 

(5.86) For[hour c , 2, Past[e v , empty (tank2)]] 

5.14 Summary 

Tsql2 is an extension of Sql-92 that provides special facilities for manipulating tem- 
poral information. Some modifications of Tsql2 were adopted in this chapter. Some 
of these are minor, and were introduced to bypass uninteresting details (e.g. refer- 
ring to explicit attributes by number) or obscure points in the Tsql2 definition (e.g. 
the new version of the INTERVAL function). Other modifications are more significant, 
and were introduced to facilitate the Top to Tsql2 translation (e.g. (SUBPERIDD) and 
(NOSUBPERIOD)). One of these more significant modifications (calendric relations) is 
generally useful. Some minor modifications of Top were also adopted in this chapter. 

A method to translate from Top to Tsql2 was framed. Each Top formula <p is 
mapped to a Tsql2 query. This is executed by the Dbms, generating a relation that 
represents (via an interpretation function) ||(/>|| M ( S *)' S *. Before the translation method 
can be used, the configurer of the Nlitdb must specify some functions (h' cons , h' p j uns , 
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Kuims' ^-'gpartsi h'cparts) that nn k certain basic Top expressions to Tsql2 expressions. 
The Top to Tsql2 translation is then carried out by a set of translation rules. The 
rules have to satisfy two theorems ( |5.1| and |5.2j ) for the translation to be correct (i.e. 
for the Tsql2 query to generate a relation that represents [|</>[| A ^ s *' ,si ). An informal 
description of the functionality of some of the rules was given. The full set of the 



translation rules, along with a proof that they satisfy theorems |5.1| and is given in 
appendix |A|. Further work could explore how to optimise the generated Tsql2 code. 

The Top to Tsql2 translation is in principle similar to the Sul to Tsql2 translation of 



| Boehlen et al. 96 1. Top and Sul, however, were designed for very different purposes, 
and the Sul to Tsql2 translation is much simpler than the Top to Tsql2 one. 



Chapter 6 



The prototype NLITDB 



Time works wonders. 



6.1 Introduction 



This chapter discusses the architecture of the prototype Nlitdb, provides some in- 
formation on how the modules of the system were implemented, and explains which 
modules would have to be added if the prototype Nlitdb were to be used in real-life 
applications. A description of the hypothetical airport database is also given, followed 
by sample questions from the airport domain and the corresponding output of the 
Nlitdb. The chapter ends with information on the speed of the system. 



6.2 Architecture of the prototype NLITDB 



Figure shows the architecture of the prototype Nlitdb. Each English question is 
first parsed using the Hpsg grammar of chapter generating an Hpsg sign. Mul- 
tiple signs are generated for questions that the parser understands to be ambiguous. 
A Top formula is then extracted from each sign, as discussed in section ^1]. Each 
extracted formula subsequently undergoes the post-processing of section 4.17| . (The 
post-processor also converts the formulae from the Top version of chapters || and |] 
to the version of [|; see section 5.5.) As discussed in section 4.17 , the post-processing 
sometimes generates multiple formulae from the same original formula. 

Each one of the formulae that are generated at the end of the post-processing captures 
what the Nlitdb understands to be a possible reading of the English question. Many 
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lexical entries 
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V 



domain-dependent 



TSQL2 query 

Figure 6.1: Architecture of the prototype NLITDB 



fully-Hedged Nlidbs use preference measures to guess which reading among the possi- 
ble ones the user had in mind, or generate "unambiguous" English paraphrases of the 
possible readings, asking the user to select one (see HAlshawi 92fl , [|Alshawi et al. 92f , 
jDc Roeck fc Lowden 86 ] and | Lowden fc De Roeck 86j| ) . No such mechanism is cur- 
rently present in the prototype Nlitdb. All the formulae that are generated at the end 
of the post-processing are translated into Tsql2. The Nlitdb prints all the resulting 
Tsql2 queries along with the corresponding Top formulae. The Tsql2 queries would 
be executed by the Dbms to retrieve the information requested by the user. As men- 



tioned in section |1.2.4| , however, the prototype Nlitdb has not been linked to a Dbms. 
Hence, the Tsql2 queries are currently not executed, and no answers are produced. 

The following sections provide more information about the grammar and parser, the 
module that extracts Top formulae from Hpsg signs, the post-processor, and the Top 
to TSQL2 translator. 



6.3 The grammar and parser 



The Hpsg version of chapter |I| was coded in the formalism of Ale Carpenter 92 1 
||Carpcntcr fc Penn 94 ], building on previous Ale encodings of Hpsg fragments by 
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Gerald Penn, Bob Carpenter, Suresh Manandhar, and Claire Grover.Q Ale can be 
thought of as a grammar-development environment. It provides a chart parser (which 



is the one used in the prototype Nlitdb; sec [Gazdar &; Mellish 89] for an introduction 
to chart parsers) and a formalism that can be used to write unification-grammars based 
on feature structures. 

Coding the Hpsg version of chapter ||] in Ale's formalism proved straight-forward. 
Ale's formalism allows one to specify grammar rules, definite constraints (these are 
similar to Prolog rules, except that predicate arguments are feature structures), lexical 
entries, lexical rules, and a hierarchy of sorts of feature structures. The schemata and 
principles of the Hpsg version of chapter || were coded using Ale grammar rules and 
definite constraints. Ale's lexical entries, lexical rules, and sort hierarchy were used 
to code Hpsg's lexical signs, lexical rules, and sort hierarchy respectively. 

The Ale grammar rules and definite constraints that encode the Hpsg schemata and 
principles are domain- independent, i.e. they require no modifications when the Nl- 
itdb is configured for a new application domain. The lexical rules of the prototype 
Nlitdb are also intended to be domain-independent, though their morphology parts 
need to be extended (e.g. more information about the morphology of irregular verbs 
is needed) before they can be used in arbitrary application domains. The lexical en- 
tries of the system that correspond to determiners (e.g. "a", "some"), auxiliary verbs, 
interrogative words (e.g. "who", "when"), prepositions, temporal subordinators (e.g. 
"while", "before"), month names, day names, etc. (e.g. "January" , "Monday") are also 
domain- independent. The person configuring the NLITDB, however, needs to provide 
lexical entries for the nouns, adjectives, and (non-auxiliary) verbs that are used in 
the particular application domain (e.g. "Eight", "open", "to land"). The largest part 
of the Nlitdb's sort hierarchy is also domain independent. Two parts of it need to 
be modified when the system is configured for a new domain: the hierarchy of world 



entities that is mounted under ind (section [4.4| and figure 4.4 on page |126|), and the 



subsorts of predicate that correspond to Top predicates used in the domain (section 



4.3 and figure 4.1 on page |122|). As will be discussed in section 6.6, tools could be 



added to help the configurer modify the domain-dependent modules. 



The prototype Nlitdb was implemented using Ale version 2.0.2 and Sicstus Prolog ver- 
sion 2.1.9. Chris Brew provided additional Ale code for displaying feature structures. 
The software of the prototype Nlitdb, including the Ale grammar, is available from 
http://www.dai.ed.ac.uk/groups/nlp/NLP_home_page.html. An earlier version of the proto 



type Nlitdb wa s implemented using the Hpsg-Pl and Pleuk systems |Popowich & Vogel 91] 



Calder et aL 94 1 
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6.4 The extractor of TOP formulae and the post-processor 



The module that extracts Top formulae from Hpsg signs actually generates Top 
formulae written as Prolog terms. For example, it generates ( |6.2[) instead of (|6.l| ). The 
correspondence between the two notations should be obvious. The Prolog-like notation 
of ( |6.2[) is also used in the formulae that are passed to the Top to Tsql2 translator, 
and in the output of the Nlitdb (see section |6.8| below). 

(6.1) ?xl v Ntense[x2> v ,president{xV)] A Past[x2 v , located-at(xl v , terminate)] 

(6.2) interrog(xl~v, and(ntense(x3~v, president (xl~v) ) , 

past(x2~v, located_at(xl~v, terminal2) ) ) ) 

The extractor of the Top formulae is implemented using Prolog rules and Ale defi- 
nite constraints (Prolog- like rules whose predicate-arguments are feature structures). 
Although the functionality of the extractor's code is simple, the code itself is rather 
complicated (it has to manipulate the internal data structures that Ale uses to repre- 
sent feature structures) and will not be discussed. 



As mentioned in section 6.2, the post-processor of figure 6.1 implements the post 



processing phase of section [4.17 . The post-processor also eliminates Part operators 



by merging them with the corresponding At, Before, or After operators, to convert 
the formulae into the Top version of the Top to Tsql2 translator (section |5.5[ ). The 
post-processor's code, which is written in Prolog, presents no particular interest and 
will not be discussed. 

6.5 The TOP to TSQL2 translator 

Implementing in Prolog the Top to Tsql2 mapping of chapter |B| proved easy. The code 



of the Top to Tsql2 translator of figure 6.1 is basically a collection of Prolog rules for 



the predicate trans. Each one of these rules implements one of the translation rules 
of section 5.11 and appendix |A[ For example, the following Prolog rule implements 



the translation rule for Past\(3,(j)'] (p. 239). (I omit some uninteresting details of the 
actual Prolog rule.) 

trans (past (_"v, PhiPrime) , Lambda, Sigma) :- 
chronons (Chronon) , 
multiappend( [ 
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" INTERSECT (" , Lambda, ", ", 

"PERIOD (TIMESTAMP 'beginning', ", 

"TIMESTAMP 'now' - INTERVAL '1' ", Chronon, ")) 

] , LambdaPrime) , 
trans (PhiPrime , LambdaPrime, SigmaPrime) , 
new.cn (Alpha) , 
corners (PhiPrime , CList) , 
length (CList, N) , 

generate_select_list (Alpha, N, SelectList) , 
multiappend( [ 

"(SELECT DISTINCT VALID(", Alpha, "), ", SelectList, 
"VALID VALID(", Alpha, ")", 
"FROM ", SigmaPrime, " AS ", Alpha, ")", 
] , Sigma) . 



The first argument of trans is the Top formula to be translated (in the notation 
of ( |6.2| )). Lambda is a string standing for the A argument of the trans function of 
section |5~10| (initially "PERIOD (TIMESTAMP 'beginning', TIMESTAMP 'forever')"). 
The generated Tsql2 code is returned as a string in Sigma. 

The chronons (Chronon) causes Chronon to become a string holding the TSQL2 name 
of the granularity of chronons (e.g. "MINUTE"). The chronons predicate is supplied by 
the configurer of the Nlitdb, along with Prolog predicates that define the h! functions 
of section |5.9| . For example, the following predicate defines h' p j: uns (inspecting,3) to be 
the SELECT statement of ( |5.45 ) on page [228| . The chronons predicate and the predicates 
that define the h! functions are the only domain-dependent parts of the Top to Tsql2 
translator. 



h_prime_pfuns_map (inspecting, 3, 

["SELECT DISTINCT insp.l, insp.2, insp.3", 
"VALID VALID (insp)" , 

"FROM inspections (PERIOD) AS insp"]). 



The first mult i append in the trans rule above generates the A' string of the trans- 
lation rule for Past[[3, 4>'] (p. |239| ). It concatenates the string-elements of the list 
provided as first argument to multiappend, and the resulting string (A') is returned 
in LambdaPrime. As in the translation rule for Past[f3, cj>'], the translation mapping is 
then invoked recursively to translate (f)' (PhiPrime). The result of this translation is 
stored in SigmaPrime. 

new_cn (Alpha) returns in Alpha a string holding a new correlation name (new_cn im- 
plements the correlation names generator of section |5.11| ). The corners (PhiPrime , 
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Figure 6.2: Extended architecture of the prototype NLITDB 



CList) causes CList to become r 4>'~ 1 , and length (CList , N) returns in N the length of 
r (p'~ l . The generate_select_list (Alpha, N, SelectList) returns in SelectList a 
string of the form Alpha . 1 , Alpha. 2, ... , Alpha. N. Finally, the second mult iappend 
returns in Sigma a string that holds the overall TSQL2 code. 



6.6 Modules to be added 



The prototype Nlitdb is intended to demonstrate that the mappings from English to 
Top and from Top to Tsql2 are implement able. Consequently, the architecture of 
the prototype Nlitdb is minimal. Several modules, to be sketched in the following 
sections, would have to be added if the system were to be used in real-life applications. 



Figure p.2| shows how these modules would fit into the existing system architecture. 



CHAPTER 6. THE PROTOTYPE NLITDB 

(Modules drawn with dashed lines are currently not present.) 



256 



6.6.1 Preprocessor 

The Ale parser requires its input sentence to by provided as a Prolog list of symbols 
(e.g. Q). 

(6.3) [was , ba737 , circling , at , pm5_00] 



As there is essentially no interface between the user and the Ale parser in the prototype 
Nlitdb, English questions have to be typed in this form. This does not allow words to 
start with capital letters or numbers, or to contain characters like "/" and " : " (Prolog 
symbols cannot contain these characters and must start with lower case letters). To 
bypass these constraints, proper names, dates, and times currently need to be typed in 
unnatural formats (e.g. "london", "dl_5_92", "pm5_00", "yl991" instead of "London", 
"1/5/92" "5:00pm", "1991"). 

A preprocessing module is needed, that would allow English questions to be typed in 
more natural formats (e.g. fl6.4j) ), and would transform the questions into the format 
required by the parser (e.g. fl6.3|) ). 

(6.4) Was BA737 circling at 5:00pm? 



Similar preprocessing modules are used in several natural language front-ends (e.g. Cle 



[ Alshawi 92 1 and Masque [ Lindop 86 ]). These modules typically also merge parts of 
the input sentence that need to be processed as single words. For example, the lexicon 
of the airport domain has a single lexical entry for "gate 2". The preprocessor would 
merge the two words of "gate 2" in (|6.5| ), generating (|6.6|) . (Currently, "gate 2" has to 
be typed as a single word.) 

(6.5) Which flights departed from gate 2 yesterday? 

(6.6) [which , flights , departed , f rom , gate2 , yesterday] 



The preprocessing modules typically also handle proper names that cannot be included 
in the lexicon because they are too many, or because they are not known when creating 
the lexicon. In a large airport, for example, there would be hundreds of flight names 
("BA737" , "UK1751", etc.). Having a different lexical entry for each flight name is 
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impractical, as it would require hundreds of entries to be added into the lexicon. Also, 
new flights (and hence flight names) are created occasionally, which means that the 
lexicon would have to be updated whenever a new flight is created. Instead, the lexicon 
could contain entries for a small number of pseudo-flight names (e.g. "flight-namel" , 
"Hight_name2" , . . . , "Hight-nameN" '; N is the maximum number of flight names that 
may appear in a question, e.g. 5). Each one of these lexical entries would map a 
pseudo-flight name to a Top constant (e.g. flightl, flight2, . . . , flightN).^ The 
preprocessor would use domain-dependent formatting conventions to identify flight 
names in the English question (e.g. that any word that starts with two or three capital 
letters and is followed by three or four digits is a flight name). Each flight name in the 
question would be replaced by a pseudo-flight name. For example, the preprocessor 
would turn (|6.7| ) into (|6.8|) . 



(6.7) Did BA737 depart before UK160 started to land? 

(6.8) [did , flight _namel , depart , before , flight _name2 , started , to , land] 



would then be parsed, giving rise to (|6.9|) (flightl and flightl are Top constants 
introduced by the lexical entries of "Aight_namel" and u flight-name2") . An extra step 



would be added to the post-processing phase of section |4.17| , to substitute flightl and 
flight2 with Top constants that reflect the original flight names. For example, the 
preprocessor could pass to the post-processor the original flight names ("BA737" and 
"UK160") , and the post-processor could replace flightl and flight2 by the original 
flight names in lower case. This would cause fl6.9|) to become ( |6.10| ). Similar problems 
arise in the case of dates, times, numbers, etc. 

(6.9) Before[Past[el v , Begin[landing(flight2)]},Past[e2 v ,depart(flightl)]\ 

(6.10) Befon[Past[er,Begin[landing(ukl6Q)]],Past[e2 v ,depart(ba737)]} 

No preprocessing mechanism is currently present in the prototype Nlitdb. The lexicon 
contains (for demonstration purposes) only a few entries for particular (not pseudo-) 
flight names, times, dates, and numbers (e.g. "BA737", "9:00am"). For example, there 
is no entry for "9:10am". This causes the parsing of "Which tanks were empty at 
9:10am?" to fail. In contrast the parsing of "Which tanks were empty at 9:00am?" 
succeeds, because there is a lexical entry for "9:00am". 

In the Hpsg grammar of chapter |i|, these constants would be represented using sorts like flig htl, 
flight2, . . . , fliqhtN, which would be daughters of flight_ent and sisters of flight_ent-var in figure 4.4 



_> 



on page 126 
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When both words that introduce existential quantification (e.g. "a", "some") and words 
that introduce universal quantification (e.g. "every", "each") are allowed, it is often 
difficult to decide which quantifiers should be given scope over which other quantifiers. 
For example, ( |6.f l| ) has two possible readings. Ignoring the temporal information of 
( |6.11 ), these readings would be expressed in the traditional first-order predicate logic 



(Fopl) using formulae like (|6.12 ) and fl6.13|) 



(6.11) A guard inspected every gate. 

(6.12) 3x (guard(x) A Vy (gate(y) — > inspect(x , y))) 

(6.13) Vy (gate(y) — ► 3x (guard(x) A inspect(x , y))) 



In ( 6. 12| ) , the existential quantifier (introduced by "a") is given wider scope over the 



universal one (introduced by "every") . According to ( |6 . 1 2|) , all the gates were inspected 
by the same guard. In contrast, in ( |6.13 ) where the universal quantifier is given wider 



scope over the existential one, each gate was inspected by a possibly different guard. 



In ( |6.11| ), both scopings seem possible (at least in the absence of previous context). 
In many cases, however, one of the possible scopings is the preferred one, and there 
are heuristics to determine this scoping (see chapter 8 of [ [Alshawi 92| ]). For example, 
universal quantifiers introduced by "each" tend to have wider scope over existential 
quantifiers (e.g. if the "every" of ( |6.11| ) is replaced by "each", the scoping of (6.13) 
becomes more likely than that of ( |6.12| )). 

In this thesis, words that introduce universal quantifiers were deliberately excluded 
from the linguistic coverage (section |2.13). This leaves only existential quantification 



and by-passes the quantifier scoping problem, because if all quantifiers are existen- 
tial ones, the relative scoping of the quantifiers does not matter. (In Top, existen- 
tial quantification is expressed using free variables. There are also interrogative and 
interrogative-maximal quantifiers, but these are in effect existential quantifiers that 
have the additional side-effect of including the values of their variables in the answer.) 

If the linguistic coverage of the prototype Nlitdb were to be extended to support words 
that introduce universal quantification, an additional scoping module would have to 
be added (figure |6.2| ). The input to that module would be an "under specified" Top 



formula (see the discussion in chapter 2 of [ Alshawi 92| ), a formula that would not 
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specify the exact scope of each quantifier. In a FOPL-Iike formalism, an under specified 
formula for ( |6.11| ) could look like Q6.14 ). 



(6.14) inspect((3x guard(x)), (Vy gate(y))) 

The scoping module would generate all the possible scopings, and determine the most 
probable ones, producing formulae where the scope of each quantifier is explicit (e.g. 
flyj) or (glf)). Alternatively, one could attempt to use the the Hpsg quantifier 



scoping mechanism (see [ Pollard fc Sag 94 ] and relevant comments in section 4.3 ) to 



reason about the possible scopings during the parsing. That mechanism, however, is a 
not yet fully developed part of Hpsg. 



6.6.3 Anaphora resolution 



As discussed in sections |0| and 2.12| , nominal anaphora (e.g. "she", "his salary") and 



most cases of temporal anaphora (e.g. "in January", tense anaphora) are currently 
not supported. An anaphora resolution module would be needed if phenomena of 
this kind were to be supported. As in the case of quantifier scoping, I envisage a 
module that would accept "underspecified" Top formulae, formulae that would not 



name explicitly the entities or times to which anaphoric expressions refer (figure 3.2). 
The module would determine the most probable referents of these expressions, using a 
discourse model that would contain descriptions of previously mentioned entities and 
times, information showing in which segments of the previous discourse the entities or 
times were mentioned, etc. (see [Barros & De Roeck 94] for a description of a similar 



module). The output of this module would be a formula that names explicitly the 
referents of anaphoric expressions. 



6.6.4 Equivalential translator 

The translation from Top to Tsql2 of chapter [| assumes that each pair (tt, n) of a 
Top predicate functor ir and an arity n can be mapped to a valid-time relation (stored 
directly in the database, or computed from information in the database) that shows the 
event times where 7r(ri, . . . ,r n ) is true, for all the possible world entities denoted by 
the Top terms . . . , r n . The configurer of the Nlitdb specifies this mapping when 
defining h' p j uns (section |5.9|) . Although the assumption that each (71", n) can be mapped 
to a suitable relation is valid in most situations, there are cases where this assumption 
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does not hold. The "doctor on board" problem [Rayner 93 1 is a well-known example 
of such a case. 

Let us imagine a database that contains only the following coalesced valid-time relation, 
that shows the times when a (any) doctor was on board each ship of a fleet. 



doctor _on Jboard 


ship 




Vincent 
Invincible 


[8:30am 22/1/96 - 11:45am 22/1/96] 
U [3:10pm 23/1/96 - 5:50pm 23/1/96] 
U [9:20am 24/1/96 - 2:10pm 24/1/96] 

[8:20am 22/1/96 - 10:15am 22/1/96] 
U [1:25pm 23/1/96 - 3:50pm 23/1/96] 



Let us also consider a question like ( |6.15 ), which would be mapped to the Top for- 
mula flO]). I assume here that "doctor" and "ship" introduce predicates of the form 
doctor(ri) and shipfa), and that the predicative preposition "on" introduces a pred- 
icate of the form locatedjonir^^T^) (ti,...,T4 S TERMS). For simplicity, I assume 
that "doctor" and "ship" do not introduce Ntense operators (section [4.9. 1| ). 

(6.15) Is there a doctor on some ship? 

(6.16) doctor(d v ) A ship(s v ) A Pres[located.on(d v , s v )] 

To apply the Top to Tsql2 translation method of chapter ||, one needs to map 
(doctor, 1) to a valid-time relation (computed from information in the database) that 
shows the event times where doctor (ti) is true, i.e. when the entity denoted by t\ was 
a doctor. Unfortunately, the database (which contains only doctor jon -board) does not 
show when particular entities were doctors, and hence such a relation cannot be com- 
puted. In the same manner, (ship, 1) has to be mapped to a relation that shows the 
ships that existed at each time. This relation cannot be computed: doctor _onjboard 
does not list all the ships that existed at each time; it shows only ships that had a 
doctor on board at each time. Similarly, (located-on, 2) has to be mapped to a relation 
that shows when located_on(T^,T4) is true, i.e. when the entity denoted by T3 was on 
the entity denoted by T4. Again, this relation cannot be computed. If, for example, 
T3 denotes a doctor (e.g. Dr. Adams) and T4 denotes Vincent, there is no way to find 
out when that particular doctor was on Vincent: doctor _onJboard shows only when 
some (any) doctor was on each ship; it does not show when particular doctors (e.g. 
Dr. Adams) were on each ship. Hence, the translation method of chapter || cannot be 
used. 
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It should be easy to see, however, that Q6.16 ) is equivalent to ( |6.17 ), if doctor _on~ship{T§) 



is true at event times where the entity denoted by T5 is a ship, and a doctor of that 
time is on that ship. What is interesting about ( |6.17 ) is that there is enough infor- 



mation in the database to map (doctor_on_ship, 1) to a relation that shows the event 
times where doctor -onshipfa) holds. Roughly speaking, one simply needs to map 
(doctor _on_ship, 1) to the doctor _on_board relation. Hence, the Top to Tsql2 trans- 
lation method of chapter |5] can be applied to ( |6.17 ), and the answer to ( |6.15| ) can be 
found by evaluating the resulting Tsql2 code. 

(6.17) Pres [doctor _on_ship(s v )] 



The problem is that ( |6.15| ) cannot be mapped directly to ( 6,17j ): the English to Top 



mapping of chapter [| generates ( |6.16| ). We need to convert ( |6.16| ) (whose pred- 



icates are introduced by the lexical entries of nouns, prepositions, etc.) to ( 6.17 ) 
(whose predicates are chosen to be mappable to relations computed from information 
in the database). An "equivalential translator" similar to the "abductive equivalen- 



tial translator" of [ Rayner 93| ] and [Alshawi et al. 92] could be used to carry out this 



conversion. Roughly speaking, this would be an inference module that would use 
domain-dependent conversion rules, like ( |6.18j ) which allows any formula of the form 
doctor(T\) A ship(T2) A Pres[located-on(ri, T2)] {t\,ti G TERMS) to be replaced by 
Pres\doctor -.onship(j'i)\. ( 6.18 ) would license the conversion of (|1|) into (l6~17|) . 



(6.18) doctor{j\) A ship(j2) A Pres[located_on(Ti, T2)] = Pres[doctor_onship(T2)] 

There would be two kinds of pairs (ir, n) (tt is a predicate functor and n an arity): pairs 
that are mapped to relations, and pairs for which this mapping is impossible (the value 
of h' p j uns would be undefined for the latter). The formula generated after the scoping 



and anaphora resolution would be passed to the equivalential translator (figure 3.2). 
If all the predicate functors and arities in the formula are among the pairs that are 
mapped to relations, the equivalential translator would have no effect. Otherwise, the 
equivalential translator would attempt to convert the formula into another one that 
contains only predicate functors and arities that are mapped to relations (an error 
would be reported if the conversion is impossible). The new formula would then be 
passed to the Top to Tsql2 translator. 
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6.6.5 Response generator 

The execution of the Tsql2 code produces the information that is needed to answer 
the user's question. A response generator is needed to report this information to the 
user. In the simplest case, if the question is a yes/no one, the response generator would 
simply print a "yes" or "no", depending on whether or not the Tsql2 code retrieved 
at least one tuple (section |5.10| ). Otherwise, the response generator would print the 
tuples retrieved by the Tsql2 code. 

Ideally, the response generator would also attempt to provide cooperative responses 
(section L4; see also section |8.2| below). In (|6.19| ), for example, if BA737 is at gate 



4, the response generator would produce (|6.20| ) rather than a simple "no". That is, it 
would report the answer to ( 6.19 ) along with the answer to (16.21 ). 



(6.19) Is BA737 at gate 2? 

(6.20) No, BA737 is at gate 4. 

(6.21) Which gate is BA737 at? 

In that case, the architecture of the Nlitdb would have to be more elaborate than that 
of figure 6.2, as the response generator would have to submit questions (e.g. ( 6.21] )) on 



its own, in order to collect the additional information that is needed for the cooperative 
responses. 

6.6.6 Configuration tools 

As already noted, there are several parts of the prototype Nlitdb that need to be 
modified whenever the Nlitdb is configured for a new application. Most large-scale 
Nlidbs provide tools that automate these modifications, ideally allowing people that 
are not aware of the details of the Nlitdb's code to configure the system (see section 
6 of [ Androutsopoulos et al. 95b], and chapter 11 of [ Alshawi 92| ]). A similar tool is 



needed in the prototype Nlitdb of this thesis. Figure 6.2 shows how this tool would 
fit into the Nlitdb's architecture.^ 

3 Some of the heuristics of the quantifier scoping module and parts of the anaphora resolution module 
may in practice be also domain-dependent. In that case, parts of these modules wou ld also have to 
be modified during the configuration. For simplicity, this is not shown in figure p.2[ 
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gates(gate, availability) 
runway s(runway, availability) 
queues(queue, runway) 
servicers (servicer) 
inspector s(inspector) 
sector s(sector) 
flights(f light) 
tanks(tank, content) 

norm jdepartures( flight, norm Aep -time, norm_dep_gate) 
nor m jar rivals( flight, nor m_arr dime, norm_arr_gate) 
norm_servicer (flight, servicer) 
f light Jocations( flight, location) 
circling(f light) 

inspections(code, inspector, inspected, status) 

services[code, servicer, flight, status) 

boar ding s(code, flight, gate, status) 

landings (code, flight, runway, status) 

takeoff s (code, flight, runway, status) 

taxiing s(code, flight, origin, destination, status) 

Figure 6.3: Relations of the airport database 



6.7 The airport database 



This section provides more information about the hypothetical airport database, for 
which the prototype Nlitdb was configured. 

The airport database contains nineteen relations, all valid-time and coalesced (section 
5.2. 3j ). Figure 6.3 shows the names and explicit attributes of the relations. For simplic- 
ity, I assume that the values of all the explicit attributes are strings. I also assume that 
chronons correspond to minutes, and that the gregorian calendric relation of section 



5.3.3 is available. The runways relation has the following form: 



runways 


runway 


availability 




runway 1 

runway 1 

runwayl 
runwayl 


open 

closed 

open 
closed 


[8:00am 1/1/96, 7:30pm 3/1/96] 
U [6:00am 4/1/96, 2:05pm 8/1/96] U ... 

[7:31pm 3/1/96, 5:59am 4/1/96] 
U [2:06pm 8/1/96, 5:45pm 8/1/96] 

[5:00am 1/1/96, 9:30pm 9/1/96] U ... 

[9:31pm 9/1/96, 10:59am 10/1/96] U ... 



The availability values are always open or closed. There are two tuples for each runway: 
one showing the times when the runway was open, and one showing the times when it 
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was closed. If a runway did not exist at some time (e.g. a runway may not have been 
constructed yet at that time) , both tuples of that runway exclude this time from their 
time-stamps. The gates relation is similar. Its availability values are always open or 
closed, and there are two tuples for each gate, showing the times when the gate was 
open (available) or closed (unavailable) respectively. 

Runways that are used for landings or take-offs have queues, where flights wait until 
they are given permission to enter the runway. The queues relation lists the names of 
the queues that exist at various times, along with the runways where the queues lead 
to. The servicers relation shows the names of the servicing companies that existed 
at any time. The inspectors, sectors, and flights relations are similar. The tanks 
relation shows the contents (water, foam, etc., or empty if the tank was empty) of 
each tank at every time where the tank existed. 

Each outgoing flight is assigned a normal departure time and gate (see also section 
Pp . The norm-departures relation shows these times and gates. For example, if 



normjiepartures were as follows, this would mean that from 9:00am on 1/1/92 to 
5:30pm on 31/11/95 BA737 normally departed each day from gate 2 at 2:05pm. (For 
simplicity, I assume that all flights are daily.) At 5:31pm on 31/11/95, the normal 
departure time of BA737 was changed to 2:20pm, while the normal departure gate 
remained gate 2. No further change to the normal departure time or gate of BA737 
was made since then. 



norm-departures 


flight 


norm -dep Jtime 


norm-dep-gate 




BA737 


2:05pm 


gatel 


[9:00am 1/1/92, 5:30pm 31/11/95] 


BA737 


2:20pm 


gatel 


[5:31pm 31/11/95, now] 



Similarly, each incoming flight is assigned a normal arrival time and a gate, listed in 
norm-arrivals. Flights are also assigned normal servicers, servicing companies that 
over a period of time normally service the flights whenever they arrive or depart. 
This information is stored in normservicer . The flight-locations relation shows the 
location of each flight over the time. Possible location values are the names of airspace 
sectors, gates, runways, or queues of runways. The circling relation shows the flights 
that were circling at each time. 



As discussed in section |2.4.5| , flights, gates, and runways are occasionally inspected. 
The inspections relation was discussed in section |5.9| . It shows the inspection code, 
inspector, inspected object, status (completed or not), and time of each inspection. 



CHAPTER 6. THE PROTOTYPE NLITDB 265 

The services, boardings, landings, takeoffs, and taxiings relations are very similar. 
They provide information about actual services, boardings, landings, take-offs, and 
taxiings from one location (origin) to another (destination). Each service, boarding, 
landing, take-off, or taxiing is assigned a unique code, stored in the code attribute. The 
status attribute shows if the climax is reached at the latest time-point of the time- 
stamp. The values of the origin and destination attributes of taxiings are names of 
gates, runways, and queues. 

Apart from relations, a database would in practice also contain integrity constraints 



(see |Gertz &; Lipeck 9q| and [|Wijsen 95 1 ). There would be, for example, a constraint 
saying that if the circling relation shows a flight as circling at some time, the flights 
relation must show that flight as existing at the same time. I do not discuss integrity 
constraints, as they are not directly relevant to this thesis. 



6.8 Sample questions and output 



This section presents sample questions from the airport domain, along with the corre- 
sponding output of the prototype Nlitdb. The questions are chosen to demonstrate 
that the Nlitdb behaves according to the specifications of the previous chapters. The 
questions are not intended to be (and are probably not) a representative sample of 
questions that a real user might want to submit in the airport domain (see comments 
about Wizard of Oz experiments in section below). The user submits questions 
using the nli Prolog predicate: 

I ?- nli ( [which, flight, left, sector3, at, pm5_00, yesterday] ) . 



The system parses the question and reports the generated Hpsg sign. 

HPSG Sign: 
(phrase , 

qstore: (ne_set_quant , 

elt : (det : exists , 

restind: (index: (_10148, 

minute_ent , 
tvar :plus) , 
restr: (ne_set_psoa, 
elt : (part , 

part_var:_10148, 
partng:pm5_00) , 
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elts : e_set) ) ) , 

elts: (ne_set_quant , 

elt : (det : interrog , 

restind: (index: (_10486, 

f light_ent , 
tvar :plus) , 
restr: (ne_set_psoa, 
elt: (flight, 

argl:_10486) , 
elts : e_set) ) ) , 

elts : e_set) ) , 
synsem: (synsem, 

loc : (cat : (aspect : point , 
comps : e_list , 
head : (aux : minus , 
inv: minus , 
mod: none, 
prd: minus , 
vf orm: fin) , 
spr : e_list , 
subj : e_list) , 
cont : (at_op, 

main_psoa: (at_op, 

main_psoa: (past, 

et_handle : (temp_ent , 

tvar : plus) , 
main_psoa: (leave_something, 
argl:_10486, 
arg2 : sector3) ) , 

time_spec : _10148) , 
time_spec : yesterday) ) , 
nonloc: (inherited: slash :e_set, 
to_bind : slash : e_set) ) ) 



The sign above is written in Ale's notation. The sign is of sort phrase (it corre- 
sponds to a phrase rather than a single word), and it has the features QSTORE and 
synsem. The QSTORE value represents a non-empty set of quantifiers (neset_quant). 
Its elt feature describes the first element of that set, which is an existential quanti- 
fier. The quantifier ranges over a Top variable, represented by an Hpsg index of sort 



minute-ent (see figure 4.4 on page |126j ) whose tvar is + (the index represents a Top 
variable rather than a constant). The elt value represents the Top-like expression 
3x2 v Part [pm5J)0 9 , x2 v ]. The Prolog variable _10148 is a pointer to the index of the 
quantifier, i.e. it plays the same role as the boxed numbers (e.g. CD, [2]) in the Hpsg 
formalism of chapter |3]. The elts value describes the rest of the set of quantifiers, 
using in turn an elt feature (second element of the overall set), and an elts feature 
(remainder of the set, in this case the empty set). The second element of the overall set 
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represents the Top expression lxl v flight(xl v ). In the airport application, the lexi- 
cal entries of non-predicative nouns do not introduce Ntense operators (this generates 
appropriate readings in most cases; see the discussion in section |4.9.1| ). This is why no 
Ntense operator is present in the second quantifier of the sign. (The effect of Ntenses 
can still be seen in the airport application in the case of non-predicative adjectives, 
that do introduce Ntenses.) 

The features of the synsem value are as in chapter ||. The CONT value represents the 
Top expression At[yesterday, At[x2 v , Past [x3 v , leavesomething(xl v , sectorS)]]]. The 



extractor of section |6.4| extracts a Top formula from the sign, and prints it as a Prolog 
term. 

TOP formula extracted from HPSG sign: 

interrog(xl~v, 

and (part (pm5_00~g, x2~v) , 
and(f light (xl~v) , 
at (yesterday, 
at (x2~v, 

past (x3~v, 

leave_something(xl~v, sector3) )))))) 

The Prolog term above stands for: 

IxV Part[pm5.00 9 ,x2 v ] A flight(xl v ) A 

At[y ester day, At[x2 v , Past[x3 v , leavesomething(xl v , sector3)]]] 



The extracted formula then goes through the post-processor of section 3.4. The post- 
processor eliminates the Part operator, adding the pm5-O0 9 as an extra argument to 
the corresponding At operator: 

TOP formula after post-processing: 

interrog(xl~v, 

and(f light (xl~v) , 
at (yesterday, 

at(pm5_00~g, x2~v, 
past (x3~v, 

leave_something(xl~v, sector3) ) ) ) ) ) 

The Prolog term above stands for: 

(6.22) IxV flight(xl v ) A At[yesterday, 

At\pm5J)0 9 , x2 v , Past[x3 v , leavesomething(xl v , sector3)]]] 
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The post-processed formula is then translated into Tsql2: 



TSQL2 query: 

(SELECT DISTINCT SNAPSHOT t8 . 1 

FROM (SELECT DISTINCT t6.1, t7.1, t7.2, t7.3, t7.4 
VALID VALID (t6) 
FROM (SELECT DISTINCT tl.l 
VALID VALID(tl) 
FROM (SELECT DISTINCT fl.l 
VALID VALID (fl) 
FROM flights (PERIOD) AS fl 
) (SUBPERIOD) AS tl 
WHERE PERIOD (TIMESTAMP 'beginning', TIMESTAMP 'forever') 
CONTAINS VALID (tl) 
) AS t6, 

(SELECT DISTINCT t2.1, t5.1, t5.2, t5.3 
VALID VALID (t5) 

FROM (SELECT DISTINCT SNAPSHOT VALID (cp2) 
FROM gregorian AS cp2 
WHERE cp2.5 = '17' AND cp2 . 6 = '00' 
) AS t2, 

(SELECT DISTINCT VALID (t4), t4.1, t4.2 
VALID VALID (t4) 

FROM (SELECT DISTINCT t3 . 1 , t3 . 2 
VALID VALID (t3) 

FROM (SELECT DISTINCT floes. 1, floes. 2 
VALID PERIOD (END (VALID (floes) ) , 
END (VALID (floes))) 
FROM flight_locations (PERIOD) AS floes 
) (SUBPERIOD) AS t3 
WHERE t3.2 = 'sector3' 
AND INTERSECT ( 

INTERSECT(t2.1, 
INTERSECT ( 

PERIOD (TIMESTAMP 'beginning', 

TIMESTAMP 'forever'), 
PERIOD 'today' - INTERVAL '1' DAY)), 
PERIOD (TIMESTAMP 'beginning', 
TIMESTAMP 'now' - 

INTERVAL '1' MINUTE)) 
CONTAINS VALID (t3) 

) AS t4 
) AS t5 
) AS t7 
WHERE t6.1 = t7.3 

AND VALID (t6) = VALID (t7) 
) AS t8 

) 



The "SELECT DISTINCT fl.l ... FROM flights (PERIOD) AS fl" that starts at the sixth 
line of the Tsql2 code is the SELECT statement to which h' p j uns maps predicates of the 
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form flight(ri). This statement returns a relation that shows the flights that existed 
at each time. The embedded SELECT statement that is associated with the correlation 
name t6 is the result of applying the translation rule for predicates (section |5.11 ) 



to the flight(xl v ) of fl6.22| ). The "WHERE PERIOD (TIMESTAMP 'beginning', TIMESTAMP 
'forever' ) CONTAINS VALID (tl)" corresponds to the restriction that et must fall within 
It. (At this point, no constraint has been imposed on It, and hence It covers the whole 
time-axis.) This WHERE clause has no effect and could be removed during an optimisation 
phase (section |5.12|) . 

The "SELECT DISTINCT flocs.l... flight_locations (PERIOD) AS floes" that starts at 
the 23rd line of the TSQL2 code is the SELECT statement to which h' p j uns maps predicates 
of the form leavesomething(ri, r-i) . This statement generates a relation that for each 
flight and location, shows the end-points of maximal periods where the flight was at 
that location. The embedded SELECT statement that is associated with t4 is the result 
of applying the translation rule for predicates to the leavesomething(xl v , sector?)) of 



( |6.22j ). VALID (t3) is the leaving-time, which has to fall within It. The three nested 
INTERSECTS represent constraints that have been imposed on It: the Past operator re- 
quires It to be a subperiod of [p/i rs t,st) (i.e. a subperiod of TIMESTAMP 'beginning' , 
TIMESTAMP 'now' - INTERVAL '1' MINUTE), the At\pm5J00 9 , ...] requires It to be a sub- 
period of a 5:00pm-period (t2 . 1 ranges over 5:00pm-periods), and the At[yesterday, . . .] 
requires the localisation time to be a subperiod of the previous day (PERIOD 'today' - 
INTERVAL '1' DAY). 

The SELECT statement that is associated with t5 is generated by the translation rule 
for Past (section |5,11 ), and the SELECT statement that is associated with t7 is intro- 



duced by the translation rule for At[a g , (3, (j)'] (section |A.3.15| ) . (The At[y ester day, . . .) 
of (|6.22j) does not introduce its own SELECT statement, it only restricts It; see the 



translation rule for At[K,(f)'] in section 5.11|.) The SELECT statement that is associated 



with t8 is introduced by the translation rule for conjunction (section [A.3.3| ). It re- 
quires the attribute values that correspond to the xl v arguments of flight(xl v ) and 
leavesomething(xl v , sector?), and the event times where the two predicates are true 
to be identical. Finally, the top-level SELECT statement is introduced by the translation 



rule for ?/3i ?/?2 ?/% • • • <p' (section A. 4.1). It returns a snapshot relation that 



contains the attribute values that correspond to xV (the flights). 
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No further comments need to be made about the generated Hpsg signs and Tsql2 
queries. To save space, I do not show these in the rest of the examples. I also do 
not show the Top formulae before the post-processing, unless some point needs to be 
made about them. 



As noted in section 2.5.3, no attempt is made to block progressive forms of state verbs. 
The progressive forms of these verbs are taken to have the same meanings as the 
corresponding non-progressive ones. This causes the two questions below to receive 
the same Top formula. 

I ?- nli( [which, tanks, contain, water] ) . 

TOP formula after post-processing: 

interrog(xl~v, 

and (tank (xl~v) , 

pres(contains(xl~v, water)))) 

I ?- nli( [which, tanks, are, containing, water] ) . 

TOP formula after post-processing: 

[same formula as above] 

There are two lexical entries for the base form of "to service", one for the habitual 
homonym, and one for the non-habitual one. The habitual entry introduces the pred- 
icate functor habservicer_of and classifies the base form as state. The non-habitual 
entry introduces the functor actl servicing and classifies the base form as culminat- 



ing activity. The simple present lexical rule (section 4.7.1) generates a simple present 
lexical entry for only the habitual homonym (whose base form is state). Hence, the 
"services" below is treated as the simple present of the habitual homonym (not as the 
simple present of the non-habitual homonym), and only a formula that contains the 
habservicer-of functor is generated. This captures the fact that the question can 
only have a habitual meaning (it cannot refer to a servicer that is actually servicing 
BA737 at the present; the reader is reminded that the scheduled-to-happen reading of 



the simple present is ignored in this thesis - see section 2.5.1 ). 



I ?- nli( [which, servicer, services, ba737] ) . 
TOP formula after post-processing: 
interrog(xl~v, 
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and(servicer (xl~v) , 

pres (hab_servicer_of (xl~v, ba737) ) ) ) 



In contrast, the present participle lexical rule (section [4.7,1|) generates progressive 
entries for both the non-habitual (culminating activity base form) and the habitual 
(state base form) homonyms. This causes the question below to receive two parses, 
one where the "is servicing" is the present continuous of the non-habitual homonym, 
and one where it is the present continuous of the habitual homonym. This gives rise to 
two formulae, one involving the actl servicing functor (the servicer must be servicing 
BA737 at the present), and one involving the habservicer-of functor (the servicer 
must be the current normal servicer of BA737). (The x2 v in the first formula is an 



occurrence identifier; see section 3.16.) The habitual reading of the second formula 
seems rather unlikely in this case. 

I ?- nli( [which, servicer, is, servicing, ba737] ) . 

TOP formula after post-processing: 

interrog(xl~v, 

and(servicer (xl~v) , 

pres(actl_servicing(x2~v, xl~v, ba737)))) 

TOP formula after post-processing: 

interrog(xl~v, 

and(servicer (xl~v) , 

pres (hab_servicer_of (xl~v, ba737) ) ) ) 

There are also different lexical entries for the actual "to depart" and the habitual "to 
depart" (at some time). The habitual entry introduces the functor hab_depJ.ime, re- 
quires an "at ..." complement, and classifies the base form as state. The non-habitual 
entry introduces the functor actl_depart, requires no complement, and classifies the 
base form as point. When "BA737 departed at 5:00pm." is taken to involve the ha- 
bitual homonym, "at 5:00pm" is treated as the complement that specifies the habitual 
departure time (the second argument of hab-dep-time(ri , T2)) ■ When the sentence is 
taken to involve the non-habitual homonym, "at 5:00pm" is treated as a temporal 



modifier, and it introduces an At operator (section 4.11.1 ). In the following question, 
this analysis leads to two formulae: one where each reported flight must have actually 
departed at 5:00pm at least once in 1993, and one where the habitual departure time 
of each reported flight must have been 5:00pm some time in 1993. The second reading 



CHAPTER 6. THE PROTOTYPE NLITDB 
seems the preferred one in this example. 



272 



I ?- nli( [which, flights, departed, at, pm5_00, in, y 1993] ) . 

TOP formula after post-processing: 

interrog(xl~v, 

and(f light (xl~v) , 
at(yl993, 

at (pm5_00~g, x2~v, 
past(x3~v, 

actl_depart (xl~v) ) ) ) ) ) 

TOP formula after post-processing: 

interrog(xl~v, 

and (flight (xl~v) , 
at(yl993, 

past(x2~v, 

hab_dep_time (xl~v, pm5_00))))) 

The first question below receives no parse, because "to circle" is classified as activity 
verb (there is no habitual state homonym in this case), and the simple present lexical 
rule does not generate simple present lexical entries for activity verbs. In contrast, the 
present participle lexical rule does generate progressive entries for activity verbs. This 
causes the second question below to be mapped to the formula one would expect. The 
failure to parse the first question is justified, in the sense that the question seems to 
be asking about flights that have some circling habit, and the Nlitdb has no access 
to information on circling habits. A more cooperative response, however, is needed to 
explain this to the user. 

I ?- nli( [does, ba737, circle] ) . 

**No (more) parses. 
I ?- nli( [is, ba737, circling] ) . 
TOP formula after post-processing: 
pres (circling(ba737) ) 



Following the arrangements of section [4.11.2 , in the following question where a cul 



minating activity combines with a period adverbial, two formulae are generated: one 
where the inspection must have simply been completed on 1/5/92, and one where the 
whole inspection (from start to completion) must have been carried out on 1/5/92. 
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2.9.2 



I ?- nli( [who, inspected, ukl60, on, dl_5_92] ) . 

TOP formula after post-processing: 

interrog(xl~v, 

at(dl_5_92, 

end (past (x2~v, 

culm(inspecting(x3~v, xl~v, ukl60)))))) 

TOP formula after post-processing: 

interrog(xl~v, 

at(dl_5_92, 
past (x2~v, 

culm(inspecting(x3~v, xl~v, ukl60))))) 



In the following question, the punctual adverbial "at 5:00pm" combines with a culmi- 
nating activity. According to section |2.9.1 , two readings arise: one where the taxiing 



starts at 5:00pm, and one where it finishes at 5:00pm. In both cases, the punctual 
adverbial causes the aspect of "which Eight taxied to gate 2 at 5:00pm" to become 
point. That point sentence then combines with the period adverbial "yesterday" . Ac- 
cording to section [2.9.2 , the instantaneous situation of the point phrase (the start or 



end of the taxiing) must occur within the period of the adverbial. This analysis leads 
to two formulae: one where the taxiing starts at 5:00pm on the previous day, and one 
where the taxiing finishes at 5:00pm on the previous day. These formulae capture the 
most likely readings of the question. Unfortunately, if the order of "at 5:00pm" and 
"yesterday" is reversed, the generated formulae are not equivalent to the ones below 
(see the discussion in section |4.16|) 

I ?- nli( [which, flight, taxied, to, gate2, at, pm5_00, yesterday] ) . 

TOP formula after post-processing: 

interrog(xl~v, 

and(f light (xl~v) , 
at (yesterday, 

at(pm5_00~g, x2"v, 
end (past (x3~v, 

culm(taxiing_to (x4~v, xl~v, gate2) )))))) ) 
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TOP formula after post-processing: 



interrog(xl~v, 

and(f light (xl~v) , 
at (yesterday, 

at(pm5_00~g, x2~v, 
begin(past (x3~v, 

culm(taxiing_to(x4~v, xl~v, gate2) )))))) ) 



In the sentence below (which is treated as a yes/no question), the treatment of past 
perfects and punctual adverbials of section [4.11. 1| allows "at 5:00pm" to modify either 
the verb phrase "left gate 2", or the entire "BA737 had left gate 2". This gives rise to 
two Top formulae: one where 5:00pm is the time at which BA737 left gate 2, and one 
where 5:00pm is a reference time at which BA737 had already left gate 2. The two 
formulae capture the two most likely readings of the sentence. 

I ?- nli( [ba737,had,left,gate2,at,pm5_00] ) . 

TOP formula after post-processing: 

past(x2~v, 

perf (x3~v, 

at(pm5_00~g, xl~v, 

leave_something(ba737 , gate2)))) 

TOP formula after post-processing: 

at(pm5_00~g, xl~v, 
past (x2~v, 

perf (x3~v, 

leave_something(ba737 , gate2)))) 



Similarly, in the following question, the "at 5:00pm" is allowed to modify either the 
verb phrase "taken off", or the entire "BA737 had taken off". In the first case, the 
verb phrase still has the aspectual class of the base form, i.e. culminating activity. 



According to section 2.9.1, 5:00pm is the time where the taking off was completed 
or started. These two readings are captured by the first and second formulae below. 
(The second reading seems unlikely in this example.) In the case where "at 5:00pm" 
modifies the entire "BA737 had taken off", the "had" has already caused the aspect 
of "BA737 had taken off" to become (consequent) state. According to section [2.9.1 



in that case 5:00pm is simply a time-point where the situation of the sentence (having 
departed) holds. This reading is captured by the third formula. 

I ?- nli( [ba737, had, taken, off, at, pm5_00] ) . 
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TOP formula after post-processing: 

past(x2~v, 

perf (x3~v, 

at(pm5_00~g, xl~v, 

end(culm(taking_of f (x4"v, ba737) ) ) ) ) ) 

TOP formula after post-processing: 

past(x2~v, 

perf (x3~v, 

at(pm5_00~g, xl~v, 

begin(culm(taking_of f (x4~v, ba737) ) ) ) ) ) 

TOP formula after post-processing: 

at(pm5_00~g, xl~v, 
past (x2~v, 

perf (x3~v, 

culm(taking_off (x4~v, ba737))))) 



The first question below receives the formula one would expect. As discussed in sec- 



tion 4.16, in the second question below the grammar of chapter || allows two parses: 
one where "yesterday" attaches to "BA737 was circling", and one where "yesterday" 
attaches to "BA737 was circling for two hours". These two parses give rise to two 
different but logically equivalent formulae. 

I ?- nli( [ba737, was, circling, for, two, hours, yesterday] ) . 

TOP formula after post-processing: 

at (yesterday, 

for(hour~c, 2, 
past(xl~v, 

circling(ba737)))) 

I ?- nli( [yesterday, ba737, was, circling, for, two, hours] ) . 

TOP formula after post-processing: 

for(hour~c, 2, 

at (yesterday, 
past(xl~v, 

circling(ba737)))) 

TOP formula after post-processing: 

at (yesterday, 

for(hour~c, 2, 
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past (xl~v, 

circling (ba737)))) 



The following example reveals a problem in the current treatment of temporal mod- 
ifiers. The Hpsg version of this thesis (section |4.1l| ) allows temporal modifiers to 
attach only to finite sentences (finite verb forms that have already combined with their 
subjects and complements) or past participle verb phrases (past participles that have 
combined with all their complements but not their subjects). In both cases, the tem- 
poral modifier attaches after the verb has combined with all its complements. English 
temporal modifiers typically appear either at the beginning or the end of the sentence 
(not between the verb and its complements), and hence requiring temporal modifiers 
to attach after the verb has combined with its complements is in most cases not a 
problem. However, in the following question (which most native English speakers find 
acceptable) the temporal modifier ( "for two hours") is between the verb ( "queued") 
and its complement ("for runway2"). Therefore, the temporal modifier cannot attach 
to the verb after the verb has combined with its complement, and the system fails to 
parse the sentence. (In contrast, "UK160 queued for runway 2 for two hours.", where 
the temporal modifier follows the complement, is parsed without problems.) 

I ?- nli( [ukl60, queued, for, two, hours, for, runway2] ) . 
**No (more) parses. 



As explained in section 4.17 , the post-processing removes Culm operators that are 
within For operators introduced by "for ... " adverbials. This is demonstrated in the 
following example. The "for ... " adverbial introduces a f or_remove_culm pseudo- 
operator, which can be thought of as a For operator with a flag attached to it, that 
signals that Culms within the For operator must be removed. The post-processor 
removes the Culm, and replaces the f or_remove_culm with an ordinary For operator. 

I ?- nli( [which, flight, boarded, for, two, hours] ) . 

TOP formula extracted from HPSG sign: 

interrog(xl~v, 

and(f light (xl~v) , 

f or_remove_culm(hour~c , 2, 
past (x2~v, 

culm (boarding (x3~v, xl~v)))))) 
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TOP formula after post-processing: 

interrog(xl~v, 

and (flight (xl~v) , 
for(hour~c, 2, 
past(x2~v, 

boarding(x3~v, xl~v))))) 



Duration "in ... " adverbials introduce For operators that carry no flag to remove 
enclosed Culms. In the following question, this leads to a formula that (correctly) 
requires the boarding to have been completed. 

I ?- nli( [which, flight, boarded, in, two, hours] ) . 

TOP formula after post-processing: 

interrog(xl~v, 

and(f light (xl'v) , 
for(hour~c, 2, 
past (x2~v, 

culm (boarding (x3~v, xl~v)))))) 



As explained in section 2.5.4, the present perfect is treated in exactly the same way as 



the simple past. This causes the two questions below to receive the same formula. 

I ?- nli( [which, flight, has, been, at, gate2, for, two, hours] ) . 

TOP formula after post-processing: 

interrog(xl~v, 

and(f light (xl'v) , 
for(hour~c, 2, 
past (x2~v, 

located_at (xl~v, gate2))))) 

I ?- nli( [which, flight, was, at, gate2, for, two, hours] ) . 

TOP formula after post-processing: 

[same formula as above] 



As discussed in section 2J3, when "finished" combines with a culminating activity, the 
situation must have reached its completion. In contrast, when "stopped" combines with 
a culminating activity, the situation must have simply stopped, without necessarily 
reaching its completion. This difference is captured in the two formulae below by the 
existence or absence of a Culm. 
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I ?- nli( [j_adams, finished, inspecting, ukl60, at, pm5_00] ) . 

TOP formula after post-processing: 

at(pm5_00~g, xl~v, 
past(x2~v, 

end ( culm ( inspecting (x3~v, j_adams, ukl60))))) 

I ?- nli( [j_adams, stopped, inspecting, ukl60 , at,pm5_00] ) . 

TOP formula after post -processing: 

at(pm5_00~g, xl~v, 
past(x2~v, 

end(inspecting(x3~v, j_adams, ukl60)))) 

In the airport domain, non-predicative adjectives (like "closed" below) introduce Ntense 
operators. In the question below, the formula that is extracted from the Hpsg sign 



contains an Ntense whose first argument is a variable. As explained in section 4.17, 
this leads to two different formulae after the post-processing, one where "closed" refers 
to the present, and one where "closed" refers to the time of the verb tense. 

I ?- nli( [was, any, flight, on, a, closed, runway, yesterday] ) . 

TOP formula extracted from HPSG sign: 

and(f light (xl'v) , 

and (and (ntense (x2~v, closed(x3~v) ) , 
runway (x3~v) ) , 
at (yesterday, 
past (x4~v, 

located_at (xl~v, x3~v))))) 

**Post processing of TOP formula generated 2 different formulae. 

TOP formula after post-processing: 

and(f light (xl~v) , 

and (and (ntense (now, closed(x3~v) ) , 
runway (x3~v) ) , 
at (yesterday, 
past (x4~v, 

located_at(xl~v, x3~v))))) 

TOP formula after post-processing: 

and(f light (xl~v) , 

and (and (ntense (x4~v, closed(x3~v) ) , 
runway (x3~v) ) , 
at (yesterday, 
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past (x4~v, 

located_at(xl~v, x3~v))))) 



In the following question, the "currently" clarifies that "closed" refers to the present. 
The Ntense in the formula extracted from the Hpsg sign has now* as its first argument. 
The post-processing has no effect. 

I ?- nli( [was, any, flight, on, a, currently, closed, runway, yesterday] ) . 

TOP formula extracted from HPSG sign: 

and(f light (xl~v) , 

and (and (ntense (now, closed(x2~v) ) , 
runway (x2~v) ) , 
at (yesterday, 
past(x3~v, 

located_at(xl~v, x2~v))))) 



In the following question, the verb tense refers to the present, and hence "closed" can 
only refer to a currently closed runway. The post-processor generates only one formula, 
where the first argument of Ntense is now*. 

I ?- nli ( [is , any , flight , on, a, closed, runway] ) . 

TOP formula extracted from HPSG sign: 

and(f light (xl~v) , 

and (and (ntense (x2~v, closed(x3~v) ) , 
runway (x3~v) ) , 
pres (located_at (xl~v, x3~v)))) 

TOP formula after post-processing: 

and(f light (xl~v) , 

and (and (ntense (now, closed(x3~v) ) , 
runway (x3~v) ) , 
pres (located_at (xl~v, x3~v)))) 

Predicative adjectives do not introduce Ntenses (section |4.10|) , and Top predicates 
introduced by these adjectives always end up within the operator(s) of the verb tense. 
This captures the fact that predicative adjectives always refer to the time of the verb 
tense. 



I ?- nli( [was,gate2,open,on,monday] ) 
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TOP formula after post-processing: 

at(monday~g, xl~v 
past(x2~v, 

open(gate2) ) ) 



For reasons explained in section 4.9.2| , the system fails to parse sentences that contain 



proper names or names of days, months, etc. when these are used as predicative noun 
phrases (e.g. the first two questions below). Other predicative noun phrases pose no 
problem (e.g. the third question below). 

I ?- nli( [dl_l_91,was,a,monday] ) . 

**No (more) parses. 
I ?- nli( [ba737,is,ukl60] ) . 

**No (more) parses. 
I ?- nli( [ba737,is,a,f light] ) . 
TOP formula after post -processing: 
pres (flight (ba737)) 



Multiple interrogative words can be handled, as demonstrated below. 

I ?- nli( [which, flight, is, at, which, gate] ) . 

TOP formula after post-processing: 

interrog(xl~v, 

interrog(x2~v, 

and (gate (xl~v) , 

and(f light (x2~v) , 

pres (located_at (x2~v, xl~v)))))) 



In the first question below, the grammar of chapter [| allows "yesterday" to attach to 
either "BA737 was circling" or to the whole "did any Eight leave a gate while BA737 
was circling". Two Hpsg signs are generated as a result of this, from which two 
different but logically equivalent formulae are extracted. In contrast, in the second 
question below, the "yesterday" cannot attach to "BA737 was circling", because of 
the intervening "while" ( "while BA737 was circling" is treated as an adverbial, and 
"yesterday" cannot attach to another adverbial). Consequently, only one formula is 
generated. 
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I ?- nli( [did, any, flight, leave, a, gate, while, ba737, was, circling, yesterday] ) . 

TOP formula after post-processing: 

and(f light (xl~v) , 
and (gate (x2~v) , 

at (at (yesterday, 
past(x3~v, 

circling(ba737) ) ) , 
past (x4~v, 

leave_something(xl~v, x2~v))))) 

TOP formula after post-processing: 

and(f light (xl~v) , 
and (gate (x2~v) , 
at (yesterday, 
at (past (x3~v, 

circling(ba737) ) , 
past(x4~v, 

leave_something(xl~v, x2~v)))))) 

I ?- nli( [did, any, flight, leave, a, gate, yesterday, while, ba737, was, circling] ) . 

TOP formula after post-processing: 

and(f light (xl'v) , 
and (gate (x2~v) , 
at (past (x3~v, 

circling(ba737) ) , 
at (yesterday, 
past(x5~v, 

leave_something(xl~v, x2~v)))))) 



In the questions below, the subordinate clause is a (progressive) state. According to 



section |2.10.2| , in the first question the flights must have arrived before a time-point 
where BA737 started to board ("to arrive" is a point verb in the airport domain). In 
the second question, section |2.10.2| allows two readings: the flights must have arrived 
after a time-point where BA737 started or stopped boarding. The generated formulae 
capture these readings. 

I ?- nli( [which, flights, arrived, before, ba737, was, boarding] ) . 

TOP formula after post-processing: 

interrog(xl~v, 

and (flight (xl"v) , 

before (past (x2~v , 

boarding (x3~v, ba737)), 
past (x4~v, 
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actl_arrive (xl~v) ) ) ) ) 

I ?- nli( [which, flights, arrived, after, ba737, was, boarding] ) . 

TOP formula after post-processing: 

interrog(xl~v, 

and(f light (xl~v) , 

after (begin(past (x2~v, 

boarding(x3~v, ba737))), 

past(x4~v, 

actl_arrive (xl~v) ) ) ) ) 

TOP formula after post-processing: 

interrog(xl~v, 

and(f light (xl~v) , 

after (past (x2~v, 

boarding (x3~v, ba737)), 
past (x4~v, 

actl_arrive (xl~v) ) ) ) ) 



Below, the subordinate clause is a culminating activity. In the first question, according 
to section 2.10.2; the flights must have arrived before a time-point where BA737 finished 



or started to board. In the second question, the flights must have arrived after a time- 
point where BA737 finished boarding. These readings are captured by the generated 
formulae. 

I ?- nli( [which, flights, arrived, before, ba737, boarded] ) . 

TOP formula after post-processing: 

interrog(xl~v, 

and(f light (xl~v) , 

before (end (past (x2~v, 

culm (boarding (x3~v, ba737)))), 
past (x4~v, 

actl_arrive (xl~v) ) ) ) ) 

TOP formula after post-processing: 

interrog(xl~v, 

and(f light (xl~v) , 

before (past (x2~v , 

culm (boarding (x3~v, ba737))), 
past (x4~v, 

actl_arrive (xl~v) ) ) ) ) 

I ?- nli( [which, flights, arrived, after ,ba737, boarded] ) . 
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TOP formula after post-processing: 

interrog(xl~v, 

and (flight (xl~v) , 

after (past (x2~v, 

culm (boarding (x3~ v , ba737) ) ) , 
past(x4~v, 

actl_arrive (xl~v) ) ) ) ) 



In the next two questions, the subordinate clause is a consequent state. According to 



section 2.10.2, in the first question the flights must have arrived before the situation of 
the subordinate clause (having boarded) began, i.e. before BA737 finished boarding. In 
the second question, the flights must have arrived after the situation of the subordinate 
clause (having boarded) began, i.e. after BA737 finished boarding. These readings are 
captured by the generated Top formulae. 

I ?- nli( [which, flights, arrived, before, ba737, had, boarded] ) . 



TOP formula after post-processing: 

interrog(xl~v, 

and (flight (xl~v) , 

before (past (x2~v , 

perf (x3~v, 

culm (boarding (x4~ v , 

ba737)))) , 

past (x5~v, 

actl_arrive (xl~v) ) ) ) ) ) 



I ?- nli( [which, flights, arrived, after ,ba737, had, boarded] ) . 

TOP formula after post-processing: 

interrog(xl"v, 

and(f light (xl~v) , 

after (begin(past (x2~v, 

perf (x3~v, 

culm (boarding (x4~ v , 

ba737))))) , 

past(x5~v, 

actl_arrive (xl~v) ) ) ) ) ) 

The question below combines a "when" interrogative and a "while ... " clause. The 
generated formula asks for maximal past circling-periods of BA737 that fall within 
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I ?- nli( [when, while, ukl60, was, at, gate2, was, ba737, circling] ) . 

TOP formula after post -processing: 

interrog_mxl (x3~v , 

at(past(x2~v, 

located_at (ukl60 , gate2)), 
past(x3~v, 

circling(ba737)))) 

Finally, the question below receives two formulae: the first one asks for times of past 
actual departures; the second one asks for past normal departure times. (The latter 
reading is easier to accept if an adverbial like "in 1992" is attached.) In the second 
question, only a formula for the habitual reading is generated, because the simple 
present lexical rule (section |4.7.1] ) does not generate a simple present lexical entry for 
the non-habitual "to depart" (which is a point verb). 

I ?- nli( [when, did, ba737, depart] ) . 

TOP formula after post-processing: 

interrog_mxl (x2~v , 

past(x2~v, 

actl_depart (ba737) ) ) 

TOP formula after post-processing: 

interrog(xl~v, 

past (x2~v, 

hab_dep_time (ba737 , xl~v))) 

I ?- nli( [when, does, ba737, depart] ) . 

TOP formula after post-processing: 

interrog(xl~v, 

pres(hab_dep_time(ba737, xl~v))) 

6.9 Speed issues 

As already noted, the prototype Nlitdb was developed simply to demonstrate that the 
mappings from English to Top and from Top to Tsql2 are implement able. Execution 
speed was not a priority, and the Nlitdb code is by no means optimised for fast 
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execution. On a lightly loaded Sun SPARCstation 5, single-clause questions with single 
parses are typically mapped to TSQL2 queries in about 15-30 seconds. Longer questions 
with subordinate clauses and multiple parses usually take 1-2 minutes to process. 
(These times include the printing of all the Hpsg signs, Top formulae, and Tsql2 
queries.) The system's speed seems acceptable for a research prototype, but it is 
unsatisfactory for real-life applications. 

Whenever a modification is made in the software, the code of the affected modules has 
to be recompiled. This takes only a few seconds in the case of modules that are written 
in Prolog (the post-processor and the Top to Tsql2 translator), but it is very time- 
consuming in the case of modules that are written in Ale's formalism (the components 
of the Hpsg grammar and the extractor of Top formulae). This becomes particularly 
annoying when experimenting with the grammar, as in many cases after modifying the 
grammar all its components (sort hierarchy, lexical rules, etc.) have to be recompiled, 
and this recompilation takes approximately 8 minutes on the above machine. 

6.10 Summary 

The framework of this thesis was tested by developing a prototype Nlitdb, imple- 
mented using Prolog and Ale. The prototype was configured for the hypothetical 
airport application. A number of sample questions were used to demonstrate that the 
system behaves according to the specifications of the previous chapters. The architec- 
ture of the prototype is currently minimal. A preprocessor, mechanisms for quantifier- 
scoping and anaphora resolution, an equivalential translator, a response generator, and 
configuration tools would have to be added if the system were to be used in real-life 
applications. Execution speed would also have to be improved. 



Chapter 7 



Comparison with Previous Work 
on NLITDBs 



11 Other times other manners. 



This chapter begins with a discussion of previous work on Nlitdbs. The discussion 
identifies six problems from which previous proposals on Nlitdbs suffer. I then exam- 
ine if the framework of this thesis overcomes these problems. 



7.1 Previous work on NLITDBs 



This section discusses previous work on Nlitdbs. Clifford's work, which is the most 
significant and directly relevant to this thesis, is presented first. 



7.1.1 Clifford 



Clifford [Clifford 9C] defined a temporal version of the relational model. He also showed 
how a fragment of English questions involving time can be mapped systematically 
to logical expressions whose semantics are defined in terms of a database structured 
according to his model.[] Clifford's approach is notable in that both the semantics 
of the English fragment and of the temporal database are defined within a common 



model-theoretic framework, based on Montague semantics [Dowty et al. 81] 



1 Parts of ^Clifford 90| can be found i n flClifford 87^ ], [jClifford 87h | l , and [|Clifford 88^ , The database 
model of this section is that of Clifford 9C |. A previous version of this model appears in 
Clifford & Warren 
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Clifford extended the syntactic coverage of Montague's Ptq grammar, to allow past, 
present, and future verb forms, some temporal connectives and adverbials (e.g. "while", 
"during", "in 1978", "yesterday"), and questions. fl73]) - (O) are a11 witnin Clifford's 
syntactic coverage. (Assertions like (|7.3|) are treated as yes/no questions.) 



(7.1) Is it the case that Peter earned 25K in 1978? 

(7.2) Does Rachel manage an employee such that he earned 30K? 

(7.3) John worked before Mary worked. 

(7.4) Who manages which employees? 

(7.5) When did Liz manage Peter? 

Clifford does not allow progressive verb forms. He also claims that no distinction 
between progressive and non-progressive forms is necessary in the context of Nlitdbs 
(see p. 12 of [Clifford 87a| ). According to Clifford's view, ([7l]) can be treated in exactly 



the same manner as ( \l-7\ ). This ignores the fact that (|7.7| ) most probably refers to a 
company that habitually or normally services BA737, or to a company that will service 
BA737 according to some plan, not to a company that is actually servicing BA737 at the 



present. In contrast, (7.6) most probably refers to a company that is actually servicing 
BA737 at the present, or to a company that is going to service BA737. Therefore, 
the Nlitdb should not treat the two questions as identical, if its responses are to be 
appropriate to the meanings users have in mind. 

(7.6) Which company is servicing flight BA737? 

(7.7) Which company services flight BA737? 

Clifford also does not discuss perfect tenses (present perfect, past perfect, etc.), which 
do not seem to be allowed in his framework. Finally, he employs no aspectual taxonomy 
(this will be discussed in section 7.2.1). 



Following the Montague tradition, Clifford employs an intensional higher order lan- 
guage (called Il s ) to represent the meanings of English questions. There is a set of 
syntactic rules that determine the syntactic structure of each sentence, and a set of 
semantic rules that map syntactic structures to expressions of Il s . For example, ( |7.5[) 
is mapped to the Il s expression of ( [7.8[) . 

(7.8) A*i[[ii < i] A 3y[EMPl(ix) (Peter) A 

MGR'(ix)(y) A y(i x ) = Liz A ASS{Peter, y)]} 
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Roughly speaking, (7.8) has the following meaning. EM -P*(ii) (Peter) means that 
Peter must be an employee at the time-point i\. MGR' (ii)(y) means that y must be 
a partial function from time-points to managers (an intension in Montague semantics 
terminology) which is defined for (at least) the time-point i\. ASA(Peter, y) requires y 
to represent the history of Peter's managers (i.e. the value y(i\) of y at each time-point 
i\ must be the manager of Peter at that time-point). The y(i\) = Liz requires the 
manager of Peter at i\ to be Liz. Finally, i is the present time-point, and i\ < i means 
that i\ must precede i. ( [7~8| ) requires all time-points i\ to be reported, such that i\ 
precedes the present time-point, Peter is an employee at i\, and Peter's manager at i\ 
is Liz. 



The following (from flClifford 90j ) is a relation in Clifford's database model (called 
Hrdm - Historical Relational Database Model). 



emprel 



EMP MGR 



DEPT 



SAL 



lifespan 



Peter 



52 
S3 



Elsie 
Liz 



52 — > Hardware 

53 — > Linen 



52 
S3 



30K 
3hK 



{5*2, S3} 



Liz 



52 
S3 



Elsie 
Liz 



52 
S3 



Toy 

Hardware 



52 
S3 



35K 
50K 



{5*2, 53} 



Elsie 



51 

52 



Elsie 
Elsie 



51 -> Toy 

52 -> Toy 



51 
52 



50K 
50K 



{51,52} 



The lifespan of each tuple shows the time-points ("states" in Clifford's terminology) 
for which the tuple carries information. In Hrdm, attribute values are not necessar- 
ily atomic. They can also be sets of time-point denoting symbols (as in the case of 
lifespan), or partial functions from time-point denoting symbols to atomic values. The 
relation above means that at the time-point S*2 the manager of Peter was Elsie, and 
that at S3 the manager of Peter was Liz. Hrdm uses additional time-stamps to cope 
with schema-evolution (section 1.4). I do not discuss these here. 



Clifford shows how the semantics of Il s expressions can be defined in terms of an 
Hrdm database (e.g. how the semantics of ( [7.8| ) can be defined in terms of information 
in emprel). He also defines an algebra for Hrdm, similar to the relational algebra of the 
traditional relational model [ Ullman "8^1 . (Relational algebra is a theoretical database 
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query language. Most Dbmss do not support it directly. Dbms users typically specify 
their requests in more user-friendly languages, like Sql. Dbmss, however, often use 
relational algebra internally, to represent operations that need to be carried out to 
satisfy the users' requests.) The answer to ( |7.5| ) can be found using (|7.9|), which is an 
expression in Clifford's algebra. 

(7.9) uj(a- WHEN E MP=Peter,MGR=Liz(emprel)) 



(J-W H EN em p=Peter,MGR=Liz{efnprel) generates a single-tuple relation (shown below 
as emprell) that carries the information of Peter's tuple from emprel, restricted to 
when his manager was Liz. The u operator returns a set of time-point denoting sym- 
bols, that represents all the time-points for which there is information in the relation- 
argument of lo. In our example, Q7.9D returns {S3}. 



emprel2 


BMP 


MGR. 


DEPT 


SAL 


lifespan 


Peter 


[S3 -> Liz] 


[S3 — > Linen] 


[S3 -> 35K] 


{S3} 



Clifford outlines an algorithm for mapping Il s expressions to appropriate algebraic 
expressions (e.g. mapping (|7.8|) to ( |7.S|) ; see p. 170 of [ Clifford 90|| ). The description of 
this algorithm, however, is very sketchy and informal. 



According to Clifford ( jClifford 87af| , p.16), a parser for his version of the Ptq grammar 
(that presumably also maps English questions to Il s expressions) has been developed. 
Clifford, however, does not provide any information on whether or not a translator 
from Il s to his algebra was ever implemented (as noted above, this mapping is not 
fully defined), and there is no indication that Clifford's framework was ever used to 
implement an actual Nlitdb. 



7.1.2 Bruce 



Bruce's CHRONOS Bruce 72 1 is probably the first natural language question-answering 
system that attempted to address specifically time-related issues. Chronos is not 
really an interface to a stand-alone database system. When invoked, it has no infor- 
mation about the world. The user "teaches" Chronos various facts (using statements 
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like (7.10) and ( [7.11 )), which are stored internally as expressions of a Lisp-like repre- 



sentation language. Questions about the stored facts can then be asked (e.g. ( 7. 12] ), 



(7.10) The American war for independence began in 1775. 

(7.11) The articles of confederation period was from 1777 to 1789. 

(7.12) Does the American war for independence coincide with the time from 1775 
to 1781? 

(7.13) Did the time of the American war for independence overlap the articles of 
confederation period? 

Bruce defines formally a model of time, and explores how relations between time- 
segments of that model can represent the semantics of some English temporal mecha- 
nisms (mainly verb tenses). Bruce's time- model and temporal relations seem to under- 
lie Chronos' Lisp-like representation language. Bruce, however, provides no informa- 
tion about the representation language itself. With the exception of verb tenses, there 
is very little information on the linguistic coverage of the system and the linguistic as- 
sumptions on which the system is based, and scarcely any information on the mapping 
from English to representation language. (The discussion in [ Bruce 72| suggests that 



the latter mapping may be based on simplistic pattern-matching techniques.) Finally, 
Bruce does not discuss exactly how the stored facts are used to answer questions like 
(17T2]) and (fflaj) . 



7.1.3 De, Pan, and Whinston 



De, Pan, and Whinston [ De et al. 85 ] [ De et al. 87 | describe a question-answering sys- 



tem that can handle a fragment of English questions involving time. The "temporal 
database" in this case is a rather ad hoc collection of facts and inference rules (that 
can be used to infer new information from the facts), rather than a principled database 
built on a well-defined database model. Both the grammar of the linguistic processor 
and the facts and rules of the database are specified in "equational logic" (a kind of 
logic-programming language). There is no clear intermediate representation language, 
and it is very difficult to distinguish the part of the system that is responsible for 
the linguistic processing from the part of the system that is responsible for retrieving 
information from the "database". De et al. consider this an advantage, but it clearly 
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sacrifices modularity and portability. For example, it is very hard to see which parts 
of the software would have to be modified if the natural language processor were to be 
used with a commercial Dbms. 

The system of De et al. does not seem to be based on any clear linguistic analysis. 



There is also very little information in |De et al. 85] and [De et al. 87] on exactly which 



temporal linguistic mechanisms are supported, and which semantics are assigned to 
these mechanisms. Furthermore, no aspectual classes are used (see related comments 
in section 7.2.1]) . 



7.1.4 Moens 

Moens' work on temporal linguistic phenomena Moens 87] [Moens & Steedman 88 



has been highly influential in the area of tense and aspect theories (some ideas from 
Moens' work were mentioned in chapter ||). In the last part of jMoens 87f [ (see also 



| Moens 8q [), Moens develops a simplistic Nlitdb. This has a very limited linguistic 



coverage, and is mainly intended to illustrate Moens' tense and aspect theory, rather 
than to constitute a detailed exploration of issues related to Nlitdbs. 

As in the case of Bruce and De et al., Moens' "database" is not a stand-alone system 
built according to an established (e.g. relational) database model. Instead, it is a 
collection of Prolog facts of particular forms, that record information according to 
an idiosyncratic and unclearly defined database model. Apart from purely temporal 
information (that shows when various events took place), Moens' database model also 
stores information about episodes. According to Moens, an episode is a sequence of 
"contingently" related events. Moens uses the term "contingency" in a rather vague 
manner: in some cases it denotes a consequence relation (event A was a consequence 
of event B); in other cases it is used to refer to events that constitute steps towards 
the satisfaction of a common goal. The intention is, for example, to be able to store 
an event where John writes chapter 1 of his thesis together with an event where John 
writes chapter 2 of his thesis as constituting parts of an episode where John writes his 
thesis. Some episodes may be parts of larger episodes (e.g. the episode where John 
writes his thesis could be part of a larger episode where John earns his PhD). 

Moens claims that episodic information of this kind is necessary if certain time-related 
linguistic mechanisms (e.g. "when ... " clauses, present perfect) are to be handled 
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appropriately. Although I agree that episodic information seems to play an important 
role in how people perceive temporal information, it is often difficult to see how Moens' 
episodic information (especially when events in an episode are linked with consequence 



relations) can be used in a practical Nlitdb (e.g. in section 2.5.4, I discussed common 
claims that the English present perfect involves a consequence relation, and I explained 
why an analysis of the present perfect that posits a consequence relation is impracti- 
cal in Nlitdbs). By assuming that the database contains episodic information, one 
also moves away from current proposals in temporal databases, that do not consider 
information of this kind. For these reasons, I chose not to assume that the database 
provides episodic information. As was demonstrated in the previous chapters, even 
in the absence of such information reasonable responses can be generated in a large 
number of cases. 

Moen's database model is also interesting in that it provides some support for imprecise 
temporal information. One may know, for example, that two events A and B occurred, 
and that B was a consequence of A, without knowing the precise times where A and 
B occurred. Information of this kind can be stored in Moens' database, because in 
his model events are not necessarily associated with times. One can store events A 
and B as a sequence of contingently related events (here contingency would have its 
consequence meaning) without assigning them specific times. (If, however, there is no 
contingency relation between the two events and their exact times are unknown, Moens' 
model does not allow the relative order of A and B to be stored.) Although there has 
been research on imprecise temporal information in databases (e.g. |Brusoni et al. i 



[Koubarakis 95]), most of the work on temporal databases assumes that events are 
assigned specific times. To remain compatible with this work, I adopted the same 
assumption. 

Moens' system uses a subset of Prolog as its meaning representation language. En- 
glish questions are translated into expressions of this subset using a Dcg grammar 
[Pereira & Warren 8C], and there are Prolog rules that evaluate the resulting expres- 



sions against the database. Moens provides no information about the Dcg grammar. 
Also, the definition of the meaning representation language is unclear. It is difficult to 
see exactly which Prolog expressions are part of the representation language, and the 
semantics of the language is defined in a rather informal way (by listing Prolog code 
that evaluates some of the possible expressions of the representation language against 
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the database). 



7.1.5 Spenceley 



Spenceley [ [Spenceley 89| developed a version of the Masque natural language front- 
end [ Auxerre fc Inder 86| ] that can cope with certain kinds of imperatives and temporal 



questions. The front-end was used to interface to a Prolog database that modelled a 



blocks- world similar to that of Winograd's Shrdlu [Winograd 73]. The dialogue in 



( 7.14 ) - ( |7.19| ) illustrates the capabilities of Spenceley's system. The user can type 



imperatives, like (7.14) and ( [7.15 ), that cause the database to be updated to reflect 



the new state of the blocks-world. At any point, questions like (7.16) and (7.1 8|) can 



be issued, to ask about previous actions or about the current state of the world. 

(7.14) Take Cubel. 

(7.15) Put Cubel on Cube2. 

(7.16) What was put on Cube2? 

(7.17) Cubel. 

(7.18) Is Cube2 on Cubel? 

(7.19) No. 

A simplistic aspectual taxonomy is adopted, that distinguishes between states and 
actions (the latter containing Vendler's activities, accomplishments, and achievements; 
see section |2.2|) . The linguistic coverage is severely restricted. For example, the user 
can ask about past actions and present states (e.g. (|7.16[) , (|7.18|) ), but not about past 
states ( (|7.20 ) is rejected). Only "while ... ", "before ... ", and "after ..." subordinate 



clauses can be used to specify past times, and subordinate clauses can refer only to 
actions, not states (e.g. ( |7.21 ) is allowed, but ( 7.22| ) is not). Temporal adverbials, like 



"at 5:00pm" in ( |7.23|) , are not supported. Spenceley also attempts to provide some 
support for tense anaphora (section 2.12| ), but her tense anaphora mechanism is very 



rudimentary. 

(7.20) x Where was Cubel? 

(7.21) What was taken before Cubel was put on Cube2? 

(7.22) x What was taken before Cubel was on Cube2? 
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(7.23) x What was taken at 5:00pm? 



The English requests are parsed using an "extraposition grammar" jPereira 8l | , and 



they are translated into a subset of Prolog that acts as a meaning representation 
language.0 The resulting Prolog expressions are then executed by the Prolog interpreter 
to update the database or to retrieve the requested information. The "database" is 
a collection of ad hoc Prolog facts (and in that respect similar to the "databases" of 
Bruce, De et al., and Moens). It stores information about past actions, but not states 
(this is probably why questions like (|7.22j ) are not allowed). Also, the database records 
temporal relations between actions (which action followed which action, which action 
happened during some other action), but not the specific times where the actions 
happened. Hence, there is no information in the database to answer questions like 
(|7T2l ), that require the specific times where the actions happened to be known. 

7.1.6 Brown 



Brown | Brown 94 ] describes a question-answering system that can handle some tem- 
poral linguistic phenomena. As in Bruce's system, the user first "teaches" the system 
various facts (e.g. "Pedro is beating Chiquita."), and he/she can then ask questions 
about these facts (e.g. "Is he beating her?"). Brown's system is interesting in that it 
is based on Discourse Representation Theory (Drt), a theory in which tense and as- 



pect have received particular attention | Kamp fc Reyle 93 ]. Brown's system, however, 



seems to implement the tense and aspect mechanisms of Drt to a very limited ex- 
tent. Brown shows only how simple present, simple past, present continuous, and past 
continuous verb forms can be handled. Other tenses, temporal adverbials, temporal 
subordinate clauses, etc. do not seem to be supported. 

Brown's system transforms the English sentences into Drt discourse representation 



structures, using a grammar written in an extended Dec version | Covington 



Brown provides very little information about this grammar. The relation of Brown's 



grammar to that sketched in [Kamp & Reyle 95] is also unclear. The discourse repre- 



sentation structures are then translated into Prolog facts (this turns out to be relatively 
straight-forward). As in Moens' and Spenceley's systems, the "database" is a collection 
of Prolog facts, rather than a principled stand-alone system. 

2 The syntax and semantics of a similar Prolog subset, that is used as the m eaning representation 



language of another version of MASQUE, are defined in [ Androutsopoulos 92] 
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7.1.7 Other related work 



Hafner |Hafner 85] considers the inability of existing Nlidbs to handle questions in- 
volving time a major weakness. Observing that there is no consensus among database 
researchers on how the notion of time should be supported in databases (this was true 



when | Hafner 85 ] was written), Hafner concludes that Nlidb designers who wish their 
systems to handle questions involving time cannot look to the underlying DBMS for 
special temporal support. She therefore proposes a temporal reasoning model (con- 
sisting of a temporal ontology, a Prolog-like representation language, and inference 
rules written in Prolog), intended to be incorporated into a hypothetical Nlidb to 
compensate for the lack of temporal support from the Dbms. Hafner, however, does 
not describe exactly how her reasoning model would be embedded into a Nlidb (e.g. 
how the semantics of verb tenses, temporal adverbials, etc. could be captured in her 
representation language, how English questions could be translated systematically into 
her representation language, and exactly how her inference rules would interact with 



the Dbms). Also, although when [ Hafner 85 1 was written it was true that there was no 
consensus among temporal database researchers, and that the Nlidb designer could 
not expect special temporal support from the Dbms, this is (at least to some extent) 
not true at the present. A temporal database query language (Tsql2) that was de- 
signed by a committee comprising most leading temporal database researchers now 



exists, and a prototype DBMS (TimeDB; section 1.2.4 ) that supports Tsql2 has al- 
ready appeared. Instead of including into the Nlitdb a temporal reasoning module 
(as sketched by Hafner), in this thesis I assumed that a DBMS supporting Tsql2 is 
available, and I exploited Tsql2's temporal facilities. 

Mays | Mays 86] defines a modal logic which can be used to reason about possible or 
necessary states of the world (what may or will become true, what was or could have 
been true; see also the discussion on modal questions in section |1.4j ). Mays envisages 
a reasoning module based on his logic that would be used when a Nlidb attempts 
to generate cooperative responses. In ( |7.25| ), for example, the system has offered to 
monitor the database, and to inform the user when Kitty Hawk reaches Norfolk. In 



order to avoid responses like (7.27), the system must be able to reason that the distance 
between the two cities will never change. Mays, however, does not discuss exactly how 
that reasoning module would be embedded into a Nlidb (e.g. how English questions 
would be mapped to expressions of his logic, and how the reasoning module would 
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(7.24) Is the Kitty Hawk in Norfolk? 

(7.25) No, shall I let you know when she is? 

(7.26) Is New York less than 50 miles from Philadelphia? 

(7.27) Mo, shall I let you know when it is? 



Hinrichs [Hinrichs 88 1 proposes methods to address some time-related linguistic phe- 



nomena, reporting on experience from a natural language understanding system that, 
among other things, allows the user to access time-dependent information stored in a 
database. Although Hinrichs' methods are interesting (some of them were discussed in 
section 2.1l| ), fHinrichs 88j ] provides little information on the actual natural language 



understanding system, and essentially no information on the underlying Dbms and 
how the intermediate representation language expressions are evaluated against the 
database. There is also no indication that any aspectual taxonomy is used, and the 
system uses a version of Montague's Ptq grammar (see related comments in section 
7T2T3| below). 



Finally, in CLE (a generic natural language front-end | Alshawi 92j ]) verb tenses intro- 
duce into the logical expressions temporal operators, and variables that are intended 
to represent states or events. The semantics of these operators and variables, how- 
ever, are left undefined. In Clare (roughly speaking, a Nlidb based on Cle; see 
[ Alshawi et al. 9% ) the temporal operators are dropped, and verb tenses are expressed 



using predications over event or state variables. The precise semantic status of these 



variables remains obscure. Both [ Alshawi 92 1 and [ Alshawi et al. 92 1 do not discuss 



temporal linguistic phenomena in any detail. 



7.2 Assessment 



It follows from the discussion in section fO| that previous approaches to Nlitdbs suffer 
from one or more of the following: (i) they ignore important English temporal mech- 
anisms, or assign to them over-simplified semantics (e.g. Clifford, Spenceley, Brown), 
(ii) they lack clearly defined meaning representation languages (e.g. Bruce, De et al., 
Moens), (iii) they do not provide complete descriptions of the mappings from natural 
language to meaning representation language (e.g. Bruce, Moens, Brown), or (iv) from 
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meaning representation language to database language (e.g. Clifford), (v) they adopt 
idiosyncratic and often not well-defined database models or languages (e.g. Bruce, De 
et al., Moens, Spenceley, Brown), (vi) they do not demonstrate that their ideas are 
implementable (e.g. Clifford, Hafner, Mayes). In this section I assess the work of this 
thesis with respect to (i) - (vi), comparing mainly to Clifford's work, which constitutes 
the most significant previous exploration of Nlitdbs. 

7.2.1 English temporal mechanisms and their semantics 

In section |7.1.1| , I criticised Clifford's lack of aspectual taxonomy. It should be clear 
from the discussion in chapter [2] that the distinction between aspectual classes per- 
tains to the semantics of most temporal linguistic mechanisms, and that without an 
aspectual taxonomy important semantic distinctions cannot be captured (e.g. the fact 
that the simple past of a culminating activity verb normally implies that the climax 
was reached, while the simple past of a point, state, or activity verb carries no such 
implication; the fact that an "at ..." adverbial typically has an inchoative or terminal 
meaning with a culminating activity, but an interjacent meaning with a state, etc.) 
The aspectual taxonomy of this thesis allowed me to capture many distinctions of this 
kind, which cannot be accounted for in Clifford's framework. Generally, this thesis ex- 
amined the semantics of English temporal mechanisms at a much more detailed level 
compared to Clifford's work. Particular care was also taken to explain clearly which 
temporal linguistic mechanisms this thesis attempts to support, which simplifications 
were introduced in the semantics of these mechanisms, and which phenomena remain 



to be considered (see table 2J) on page ^ for a summary). This information is difficult 



to obtain in the case of Clifford's work. 

In terms of syntactic coverage of time-related phenomena, the grammar of this thesis is 
similar to Clifford's. Both grammars, for example, support only three kinds of temporal 
subordinate clauses: "while ... ", "before ... ", and "after ... " clauses. Clifford's 
grammar allows simple-future verb forms (these are not supported by the grammar 
of this thesis), but it does not allow progressive or perfect forms (which are partially 
supported by the grammar of this thesis). The two grammars allow similar temporal 
adverbials (e.g. "in 1991", "before 3/5/90", "yesterday"), though there are adverbials 
that are supported by Clifford's grammar but not by the grammar of this thesis (e.g. 
"never", "always"), and adverbials that are supported by the grammar of this thesis 
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but not by Clifford's (e.g. "for five hours", "in two days"). Both grammars support 
yes/no questions, "Who/What /Which ... ?" and "When ... ?" questions, multiple 
interrogatives (e.g. "Who inspected what on 1/1/91?"), and assertions (which are 
treated as yes/no questions). The reader is reminded, however, that Clifford assigns 
to temporal linguistic mechanisms semantics which are typically much shallower than 
the semantics of this thesis. 

Although the framework of this thesis can cope with an interesting set of temporal 
linguistic phenomena, there are still many English temporal mechanisms that are not 
covered (e.g. "since ... " adverbials, "when ..." clauses, tense anaphora). Hence, the 
criticism about previous approaches, that important temporal linguistic mechanisms 
are not supported, applies to the work of this thesis as well. (It also applies to Clifford's 
framework, where most of these mechanisms are also not covered.) I claim, however, 
that the temporal mechanisms that are currently supported are assigned sufficiently 
elaborate semantics, to the extent that the other criticism about previous approaches, 
that they use over-simplified semantics, does not apply to the work of this thesis. I 
hope that further work on the framework of this thesis will extend its coverage of 
temporal phenomena (see section |8.2j below). 

7.2.2 Intermediate representation language 

From the discussion in section [O], it follows that some previous proposals on Nlitdbs 
(e.g. Bruce, De et al., Moens) use meaning representation languages that are not 
clearly defined. (Clifford's work does not suffer from this problem; his Il s language is 
defined rigorously.) This is a severe problem. Without a detailed description of the 
syntax of the representation language, it is very difficult to design a mapping from 
the representation language to a new database language (one may want to use the 
linguistic front-end with a new DBMS that supports another database language), and to 
check that existing mappings to database languages cover all the possible expressions 
of the representation language. Also, without a rigorously defined semantics of the 
representation language, it is difficult to see the exact semantics that the linguistic 
front-end assigns to natural language expressions, and it is impossible to prove formally 
that the mapping from representation language to database language preserves the 
semantics of the representation language expressions. This pitfall was avoided in this 
thesis: both the syntax and the semantics of Top are completely and formally defined 
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7.2.3 Mapping from English to representation language 



In section 7.1, I noted that some previous Nlitdb proposals (e.g. Bruce, Moens) pro- 
vide very little or no information on the mapping from English to meaning represen- 
tation language. (Again, this criticism does not apply to Clifford's work; his mapping 
from English to Il s is well-documented.) In this thesis, this pitfall was avoided: I 
adopted Hpsg, a well-documented and currently widely-used grammar theory, and I 
explained in detail (in chapter ||) all the modifications that were introduced to Hpsg, 
and how Hpsg is used to map from English to Top. I consider the fact that this 
thesis adopts Hpsg to be an improvement over Clifford's framework, which is based 
on Montague's ageing Ptq grammar, and certainly a major improvement over other 
previous Nlitdb proposals (e.g. Bruce, Spenceley, De et al., Moens) that employ ad 
hoc grammars which are not built on any principled grammar theory. 



7.2.4 Mapping from representation language to database language 

As mentioned in section [7. 1 . 1| , Clifford outlines an algorithm for translating from Il s 
(his intermediate representation language) to a version of relational algebra. This 
algorithm, however, is described in a very sketchy manner, and there is no proof that 
the algorithm is correct (i.e. that the generated algebraic expressions preserve the 
semantics of the Il s expressions). In contrast, the Top to Tsql2 mapping of this 
thesis is defined rigorously, and I have proven formally that it generates appropriate 
TSQL2 queries (chapter || and appendix |X|) . 

7.2.5 Temporal database model and language 

Several previous proposals on Nlitdbs (e.g. De et al., Spenceley, Moens) adopt tem- 
poral database models and languages that are idiosyncratic (not based on estab- 
lished database models and languages) and often not well-defined. Although Clifford's 
database model and algebra are well-defined temporal versions of the traditional re- 
lational database model and algebra, they constitute just one of numerous similar 
proposals in temporal databases, and it is unlikely that Dbmss supporting Clifford's 
model and algebra will ever appear. This thesis adopted Tsql2 and its underlying 
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Bcdm model. As already noted, Tsql2 was designed by a committee comprising most 
leading temporal database researchers, and hence it has much better chances of being 
supported by forthcoming temporal Dbmss, or at least of influencing the models and 



languages that these Dbmss will support. As mentioned in section 1.2.4, a prototype 
Dbms for a version of Tsql2 has already appeared. Although I had to introduce some 
modifications to Tsql2 and Bcdm (and hence the database language and model of 
this thesis diverge from the committee's proposal), these modifications are relatively 
few and well-documented (chapter ||). 



7.2.6 Implementation 



As mentioned in section 7.1.1, although a parser for Clifford's Ptq version has been 
implemented, there is no indication that a translator from Il s to his relational algebra 
was ever constructed, or that his framework was ever used to build an actual Nlitdb. 
(Similar comments apply to the work of Hafner and Mays of section |7.1.7j .) In contrast, 
the framework of this thesis was used to implement a prototype Nlitdb. Although 
several modules need to be added to the prototype Nlitdb (section |6.6|) , the existence 
of this prototype constitutes an improvement over Clifford's work. Unfortunately, the 
Nlitdb of this thesis still suffers from the fact that it has never been linked to a Dbms 
(section ^). I hope that this will be achieved in future (see section |8^ below). 



7.3 Summary 

In terms of syntactic coverage of temporal linguistic mechanisms, the framework of this 
thesis is similar to Clifford's. The semantics that Clifford assigns to these mechanisms, 
however, are much shallower than those of this thesis. In both frameworks, there are 
several time-related phenomena that remain to be covered. Unlike some of the previous 
Nlitdb proposals, the intermediate representation language of this thesis (Top) is 
defined rigorously, and the mapping from English to Top is fully documented. Unlike 
Clifford's and other previous proposals, this thesis adopts a temporal database model 
and language (Tsql2) that were designed by a committee comprising most leading 
temporal database researchers, and that are more likely to be supported by (or at least 
influencing) forthcoming temporal Dbmss. The mapping from TOP to TSQL2 is fully 
defined and formally proven. In contrast, Clifford's corresponding mapping is specified 
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in a sketchy way, with no proof of its correctness. Also, unlike Clifford's and other 
previous proposals, the framework of this thesis was used to implement a prototype 
Nlitdb. The implementation of this thesis still suffers from the fact that the prototype 
Nlitdb has not been linked to a Dbms. I hope, however, that this will be achieved in 
future. 



Chapter 8 



Conclusions 



"Times change and we with time? 

8.1 Summary of this thesis 

This thesis has proposed a principled framework for constructing natural language 
interfaces to temporal databases (Nlitdbs). This framework consists of: 

• a formal meaning representation language (Top), used to represent the semantics 
of English questions involving time, 

• an Hpsg version that maps a wide range of English temporal questions to ap- 
propriate Top expressions, 

• a set of translation rules that turn Top expressions into suitable Tsql2 queries. 

The framework of this thesis is principled, in the sense that it is clearly defined and 
based on current ideas from tense and aspect theories, grammar theories, temporal log- 
ics, and temporal databases. To demonstrate that it is also workable, it was employed 
to construct a prototype Nlitdb, implemented using Ale and Prolog. 



Although several issues remain to be addressed (these are discussed in section 1^2 
below), the work of this thesis constitutes an improvement over previous work on 
Nlitdbs, in that: (i) the semantics of English temporal mechanisms are generally ex- 
amined at a more detailed level, (ii) the meaning representation language is completely 
and formally defined, (iii) the mapping from English to meaning representation lan- 
guage is well-documented and based on a widely-used grammar theory, (iv) a temporal 
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database language and model that were designed by a committee comprising most 
leading temporal database researchers are adopted, (v) the mapping from meaning 
representation language to database language is clearly defined and formally proven, 
(vi) it was demonstrated that the theoretical framework of this thesis is implement able, 
by constructing a prototype Nlitdb on which more elaborate systems can be based. 

8.2 Further work 

There are several ways in which the work of this thesis could be extended: 



Extending the linguistic coverage: In section 7.2, I noted that although the 



framework of this thesis can handle an interesting set of temporal linguistic mecha- 
nisms, there are still many time-related linguistic phenomena that are not supported 
(see table 2.9 on page |72|). One could explore how some of these phenomena could 



be handled. The temporal anaphoric phenomena of section p. 12 are among those 
that seem most interesting to investigate: several researchers have examined temporal 
anaphoric phenomena, e.g. [Partee 84], [Hinrichs 86|, |Webber 88|, [ Eberle &: Kasper 89| ], 
and it would be interesting to explore the applicability of their proposals to Nlitdbs. 
A Wizard of Oz experiment could also be carried out to determine which temporal 
phenomena most urgently need to be added to the linguistic coverage, and to collect 
sample questions that could be used as a test suite for Nlitdbs [ King 9(j| . (In a Wizard 
of Oz experiment, users interact through terminals with a person that pretends to be 
a natural language front-end; see flDiaper 



Cooperative responses: In section 1.4, I noted that the framework of this thesis 
provides no mechanism for cooperative responses. It became evident during the work 
of this thesis that such a mechanism is particularly important in Nlitdbs and should 
be added (cases where cooperative responses are needed were encountered in sections 
2.5.2| , |2.54 U , I2.9.2] , |2.10.U |2.10.2|, ETlC| , and To use an example from section 



2.5.2 , (|8.l| ) is assigned a Top formula that requires BA737 to have reached gate 2 for 
the answer to be affirmative. This causes a negative response to be generated if BA737 
was taxiing to gate 2 but never reached it. While a simple negative response is strictly 
speaking correct, it is hardly satisfactory in this case. A more cooperative response 
like (|8.2f) is needed. 
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(8.1) Did BA737 taxi to gate 2? 

(8.2) BA737 was taxiing to gate 2 but never reached it. 
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In other cases, the use of certain English expressions reveals a misunderstanding of 
how situations are modelled in the database and the Nlitdb. In (|8.3| ), for example, 
the "for ... " adverbial shows that the user considers departures to have durations 
(perhaps because he/she considers the boarding part of the departure; see section 
|2.4.2| ). In the airport application, however, departures are treated as instantaneous 
(they include only the time-points where the flights leave the gates), and "to taxi" is 
classified as a point verb. The "for ... " adverbial combines with a point expression, 



which is not allowed in the framework of this thesis (see table 2.4 on page |52j ). This 
causes ( |8.3[ ) to be rejected without any explanation to the user. It would be better if 
a message like (8.4) could be generated. 



(8.3) Which flight was departing for twenty minutes? 

(8.4) Departures of flights are modelled as instantaneous. 



Paraphrases: As explained in section 3.2, a mechanism is needed to generate English 



paraphrases of possible readings in cases where the Nlitdb understands a question to 
be ambiguous. 



Optimising the TSQL2 queries: As discussed in section |5.12j , there are ways in 
which the generated Tsql2 queries could be optimised before submitting them to the 
Dbms. One could examine exactly how these optimisations would be carried out. 



Additional modules in the prototype NLITDB: Section p.6| identified several 
modules that would have to be added to the prototype Nlitdb if this were to be 
used in real-life applications: a preprocessor, modules to handle quantifier scoping and 
anaphora resolution, an equivalential translator, and a response generator. Adding a 
preprocessor and a simplistic response generator (as described at the beginning of sec- 



tion 5.6.5| ) should be easy, though developing a response generator that would produce 



cooperative responses is more complicated (see the discussion above and section 6.6.5 ). 
It should also be possible to add an equivalential translator without introducing major 
revisions in the work of this thesis. In contrast, adding modules to handle quantifier 
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scoping and anaphora requires extending first the theoretical framework of this thesis: 
one has to modify Top to represent universal quantification, unresolved quantifiers, 
and unresolved anaphoric expressions (sections |6.6.2 and |6.6.3j ) , and to decide how to 



determine the scopes or referents of unresolved quantifiers and anaphoric expressions. 



Linking to a DBMS: As explained in sections 1.2.4 and 1.3, a prototype Dbms 
(TimeDb) that supports a version of Tsql2 was released recently, but the prototype 
Nlitdb of this thesis has not been linked to that system (or any other Dbms). Ob- 
viously, it would be particularly interesting to connect the Nlitdb of this thesis to 
TimeDb. This requires bridging the differences between the versions of Tsql2 that 
the two systems adopt (section |1.3j) . 



Embedding ideas from this thesis into existing NLIDBs: Finally, one could 
explore if ideas from this thesis can be used in existing natural language front-ends. 
In section |7.1.7| , for example, I noted that Cle's formulae contain temporal operators 
whose semantics are undefined. One could examine if Top operators (whose semantics 
are formally defined) could be used instead. Ideas from the Top to Tsql2 mapping of 
chapter [| could then be employed to translate the resulting Cle formulae into Tsql2. 
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Appendix A 



Translation rules and proofs for 
chapter §\ 



A.l Introduction 



This appendix contains the full set of Top to TSQL2 translation rules, and the proofs 
of theorems [iTl] and |5.2| (see chapter ||). As noted in section 5.1C, the translation rules 
specify the values of trans(6, A) for every Top formula 6 and Tsql2 value expression 
A. (In practice, A always represents a period.) There are two kinds of translation 
rules: (a) base (non-recursive) rules that define the values of trans(6, A) when 6 is an 
atomic formula or a formula of the form CWm[7r(Ti, . . . ,r n )]; and (b) recursive rules 
that define the values of trans(6, A) in all other cases (where 6 is non-atomic), by 
recursively calling other translation rules to translate subformulae of 6. 



Section |A.3| lists the translation rules for yes/no formulae 6. According to section 
5.1C, these rules have to satisfy theorem 5.2. Theorem |5.2| is proven by induction on 
the syntactic complexity of 6. I first prove that theorem |5.2| holds if 6 is a predicate 
7r(Tij . . . , r n ) or a formula of the form Culm[ir(Ti, . . . , r n )]. For all other yes/no formulae 
cj), 4> is non-atomic. In those cases, I prove that theorem |5.2| holds if it holds for all the 



subformulae of 6. Each translation rule of section A. 3 is followed by the corresponding 



part of theorem 5.2's proof. For example, the translation rule that specifies the values 
of trans(4>, A) when 6 = Pres[6'\ is followed by the proof that theorem |5.2| holds for 
6 = Pres\d'\ if it holds for 6 = 6'. 



Section A.4j lists the translation rules for wh-formulae. These rules have to satisfy 
theorem |5.1| (see section 5 . 1 0| ) . There are two translation rules for wh-formulae, that 
correspond to the case where 6 £ WHFORMSi or 6 £ WHFORMS2 (see section 
3.2). Each rule is followed by a proof that theorem 5.1 holds if 6 G WHFORMSi or 
6 G WHFORMS2 respectively. 

In the rest of this appendix, I use the term Tsql2 expression to refer to any piece of 
Tsql2 code. In contrast, the term Tsql2 value expression is used to refer to a piece 
of Tsql2 code that normally evaluates to an element of D (section |5.4| ). 
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A.2 Lemmata 



The following lemmata will prove useful in the following sections. 



Lemma A.l If £ is a Tsql2 expression, and £1, £2, £3, ■ ■ ■ , £k are substrings of £, and 
any free column reference in £ is situated within £1 or £2 or £3 or . . . or then: 

FCN(£) C FCNfa) U FCN{&) U FCNfa) U . . . U FCN(£ k ) 



Proof: Let us assume that a £ FCN(£), i.e. a is a correlation name that has a free 
column reference £ in £. We need to show that a G FCN(£x) U . . . U FCN(£k)- 

Since £ is a free column reference in £, according to the hypothesis for some i G 
{1,2,3,... , fc}, C is situated within £j. There is no binding context for £ in £j, because 
since £j is part of £, if there is a binding context for £ in £j, then there is also a (the 
same) binding context for £ in £; this would imply that £ is not a free column reference 
in £, which is against the hypothesis. 

Since there is no binding context for £ in £j, by definition £ is a free column reference 
in £j. This means that a has a free column reference in £j, i.e. a G FCN(£i). Then, 
a G FCN(£i) U . . . U FCN{t k ). Q.E.D. 

Lemma A. 2 If si G PTS 1 , 5 G G, and for every i G {1,2,3, ... ,n}, n G TERMS, 
Vi G D, and (A.l) - (A.2) hold, then (A.3) also holds. 

(A.l) If n G VARS, then g(n) = f D ( Vi ) 

(A.2) If Ti G CO/VS, then v t = h cons (st)(Ti) 

(A.3) IhH^ 8 ^ = f D (vi), 



Proof: Since TERMS = CONS U r, G GCWS or 75 G VARS: 



• n G CONS: By the definition of / cons in section U, (A.4) holds. (A.2) and (A.4) 
imply (A. 5). 

(A.4) fcons(st)(Ti) = fD{h C ons{st){Ti)) 

(A. 5) fcons(st)(Ti) = f D (Vi) 

The semantics of Top imply that f cons (st) (r f ) = ||Tj|| M ( s *)' ff . Hence, ||T;|| M ( st ^ = 

• t, G M1R5: By the semantics of Top and (A.l), Wnf' 1 ^^ = gin) = f D (vi). 
Hence, in both cases ||ri|| M(si) ' 9 = /d(^). Q-E.D. 



Lemma A.3 If st G PTS, g G G, </ G G, G YNFORMS, (n,T 2 ,T 3 , . . . ,r n ) = r </> n , 
^l) ^2,^3, • • ■ , £ (A. 6) holds, and for every variable j3 of 0, g(/3) = g'(f3), then 
(A.7) also holds. 

(A.6) h! W =/flW, ••• ,l|rn|| M(st) ' s ' =/dK) 

(A.7) Unll^' 9 = • • • , ||r n || M(st) ' 9 = /d(0 
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Proof: The definition of r (jP implies that for every i G {1, 2, 3, ... , re}, Tj G TERMS. 
Hence, n G CONS or Tj G VAflS: 

• Tj € C0^5: The semantics of Top implies that ||Tj|| M ( s *)'f = / con ,(st)(ri) = 

|| T .||M( S t), s '_ 

• Tj E K4-R5: By the definition of r 0~ l , Tj is a variable in <fi. Then, g{Ti) = cf(jj), 
because g and g' assign the same values to all the variables of <j>. Tj . The 
semantics of Top imply that ||Tj|| M ( s *)' s = g{n) and ||ri|| M ^' ff ' = g'^n). Then, 
since gin) = g'(n), \\n\\ M ^ = 

Hence, for every i G {1, 2, 3, . . . , re}, ||rj|| M ( s ^' 5 ' = ||Tj|| M ( s *)' 9 '. This conclusion and 
(A.6) imply (A.7). Q.E.D. 



Lemma A.4 If g,g' e G, (j) £ YNFORMS, r c/> n 
for every variable [3 of <f>, g(f3) = g'((3). 



(A.« 



\M(st),g 



\M{st),g' 



= (n, . . . ,r n ), and ( A. 8 ) holds, then 

M(st),g _ || ||M(st), S ' 



Proof: 

From the definition of r 4>~ 1 , for any variable (3 of <fi, there is an z G {1, 2, 3, ... , re}, such 
that Tj = /?. According to the semantics of Top: 

(A.9) \\n\\ M ^ = g(n) 

(A.io) \\n\\ M ^' = g '(n) 

(tA7s|) , ( |Ag) , and ( |A.10[) imply that #(Tj) = g'in), i.e. g(^) = g'((3). In other words, 
for every variable [3 of i^', g and 5' assign the same value to f3. Q.E.D. 



Lemma A. 5 If ti,T2,T3, . . . , r n G TERMS, vi,V2,v$, . . . ,v n G D, (A. 11) holds, and 
the mapping g : VARS 1— > OBJS is as in (A.12) (o is a particular element of OBJS, 
chosen arbitrarily), then g G G. 

(A. 11) if i,j G {1,2,3, ... ,re}, z / j, Tj = r,-, 

and Tj, T,- G V/liZS 1 , then vi = Vj 

def J fD(vi), if for some i G {1, 2,3,..., re}, (3 = n 



(A.12) 



o, otherwise 



Proof: g is a function. To show this, I need to prove that for each (3 G VARS, g{(3) 
is uniquely defined. There is only one case where g{(3) may not be uniquely defined: 
there may be two different i\,i2 G {1,2,3,... ,re}, with (3 = Tj x = Tj 2 . In this case 
I need to show that /d(^i) = /u(fi 2 )- I wm show that = Vi 2 , which implies 
/d(%) = fo(vi 2 ). The proof follows: 

If ii,42 £ {1)2,3, . . . ,n}, with /? = n x = Ti 2 G VARS, and z'i 7^ i2, then let i be the 
smaller of i\ and Z2, and j the greater of i\ and Z2- By (A. 11), Vi = Vj. This implies 
that Vi x = Vi 2 . Hence, g{(3) is uniquely defined, and g is a function. Since g also maps 
from l^fiS to OBJS, g G G. Q.E.D. 
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Lemma A.6 If t\,t\,..., t\ x , t?, . . . , t* 2 G TERMS, v\,v\,..., < v 2 n2 G D, 
st G PTS, gx,g 2 G G, (A.13) - (A. 15) hold, and the mapping g : VARS ■-> OBJS is as 
in (A. 16) (o is a particular element of OBJS, chosen arbitrarily), then g G G. 



(A.13) 

(A.14) 
(A.15) 

(A.16) 



if i G {1,2,3 



Fi 



,m}, j G {1,2,3, 
and 

l||Af(st),si = f D (~M,... Ii- J i • t/( - M ' ) -' / 



n 2 }, r^r 2 G 



if, then i£ 



■ni i 



T 



def 



2\\M(st),g 2 



\M(si),g 2 



/D«) 
fD(vL) 



gi((3), if for some i G {1, 2, 3, ... , ni}, /3 = t\ 
52 (/3), if for some j G {1,2,3, . . . ,n 2 }, /3 = r? 
o, otherwise 



Proof: 5 is a function. To show this, I need to prove that for each f3 G VARS, 
g{0) is uniquely defined. There is only one case where g{(3) may not be uniquely 
defined: there may be both an i € {1, 2, 3, ... , n{\ and a j G {1, 2, 3, ... , 712}, with 
(3 = t} = G l/^-R^. In this case, I need to prove that g\((3) = 52 i.e. that 
9~t( T l) = 92( T j)- (9i( T i) an£ i 92( T j) are uniquely defined, because 51,52 £ G, which 
implies that 51,52 are functions.) 

Let us assume that i G {1, 2, 3, ... , ni}, j G {1, 2, 3, ... , 712}, and that (3 = t} = r 2 G 
FAi^S". Since t?-,t? G VARS, the semantics of Top implies that: 

(A.17) 51 (r/) = || T i||*'(«t).si 

(A.18) 52 ( r |) = || T 2||M(^ 2 

(A.17) and (A.14) imply (A.19), while (A.18) and (A.15) imply (A. 20). 

(A.19) 9i(rl) = f D (vj) 

(A.20) g 2 {rf) = f D (v]) 

Since i G {1, 2, 3, . . . , m}, j G {1, 2, 3, . . . , n 2 }, t/,t 2 G VARS, and r/ = r 2 , (A.13) 
implies that w| = v?. This, along with (A.19) and (A.20) imply that 51 (r/) = 52 (r 2 ). 
Hence, 5(/?) is uniquely defined, and 5 is a function. Since 5 also maps from VARS to 
OBJS, 5 G G. Q.E.D. 



Lemma A.7 If « G Dp, G KAKS, 5' G G, and 5 = (5'K (v) , then 5 G G. 



Proof: To show that g G G, it is enough to show that Jd(v) G OBJS. /d(v) £ 
PERIODS, because t; G L> P . Since PERIODS C 0575 (sections and|J), / D (u) G 
OBJS. Hence, 5 G G. Q.E.D. 



A. 3 Translation rules for yes/no formulae and proof of 
theorem [572 
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A. 3.1 7r(ri,...,r n ) 
Translation rule 



If 7r G PFUNS, Tx,...,r n G TERMS, and A is a Tsql2 value expression, then: 

trans(ir(Ti, ... ,T n ),X) = 
(SELECT DISTINCT a.l, a.2, ... , a.n 
VALID VALID (a) 

FROM (^ /ms (7r,n))(SUBPERI0D) AS a 
WHERE . . . 
AND ... 

AND . . . 

AND A CONTAINS VALID (a)) 

Each time the translation rule is used, a is a new correlation name, obtained by calling 
the correlation names generator after A has been supplied. The "... "s in the WHERE 
clause correspond to all the strings in Si U S2, where: 

S 1 = {"a.i = h' cons {TiY I i G {1,2,3,... ,n} and n G CONS} 
S2 = {"a.z = a.j" I i,j G {1,2,3, . . . ,n}, i < j, n = tj, and Ti,Tj G VARS} 



Proof that theorem 5/2 holds for <fi = 7t(ti, . . . , r n ) 



I assume that it G PFUNS and n, . . . ,r n G TERMS. By the syntax of Top, this 
implies that 7t(ti, . . . ,r n ) G YNFORMS . I also assume that st G PTS, A is a Tsql2 
value expression, g db G G db , eval(st, A, g db ) G Dp, and S = trans(n(Ti, . . . , r n ), A). By 
the definition of r . . . n , r 7r(ri, . . . , r„) n = (n, . . . , T n ). I need to show that the three 
clauses of theorem |5l3 hold. 



Proof of clause 1 

The a.l, a.2, . . . , a.n in the SELECT clause of £ and the VALID (a) in the VALID and 
WHERE clauses are not free column references in E, because S is a binding context for 
all of them. For the same reason, any column references of the form a.i and a.j 
(deriving from Si and S2) i n the WHERE clause are not free column references in E. The 
only remaining parts of £ where column references (and hence free column references) 
may occur are the h' p j uns (-K,n) of the FROM clause, the A of the WHERE clause, and the 

Kons( T h), Kon S ( T i 2 ), Kons( T i 3 )i ••• , Kons( T i m ) of the WHERE clause {h' cons {T h ) , ... , 

h'cons( T im) derive from Si) r^, . . . , Tj m are all the Top constants among n, . . . , r n ). By 
lemma |A.l| , this implies that: 

(A.21) FCW(£) C FCN(h' pfuns (7r,n)) U FCN(X) U 

FCN(h' cons {r n )) U...U FCN(h' cons (r lm )) 



According to section 5.9, for every k G CONS, FCN(h' cons (n)) = 0. Since Tj m G 



CONS (see comments above), FCW(/i^ ons (rj 1 )) = 0, ... , FCN (h' cons (T im )) = 0. Ac- 
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cording to section [5.9| , it is also true that for every it G PFUNS and n£ {1,2,3,... }, 
FCN{ti pfuns (-jr,n)) = 0. Hence, (A.21) becomes FCN(Z) C FCN(X). Clause 1 has 
been proven. 



Proof of clause 2 



According to section |5l| h' p j uns (n, n) is a Tsql2 SELECT statement, FCN(h' p j uns (it, n)) = 
0, and eval(st,h' p f um (TT,n)) G NVRELp(n). The a of S ranges over the tuples of the 
relation subperiod(eval(st, h' p f uns (ir,n))). From the definition of subperiod (section 
5.3. 2| ), it is easy to see that subperiod(eval(st, h' p f uns (n,ri))) is a valid-time relation 
that has the same number of explicit attributes as eval(st, h' p f uns (Tt,n)), i.e. n. Hence, 
a ranges over tuples (v±, . . . , v n ; vt) G subperiod{eval(st, h' p j uns (it, n))). 

The a of £ is generated by calling the correlation names generator after A has been 
supplied. Hence, a cannot appear in A. Since a does not appear in A, for every tuple 
(vi, . . .,v n ;v t ): 

(A.22) eval(st,X,(g db )l 1 _ Vn . Vt) ) = eval(st, A, g dh ) 



The reader should now be able to see from the translation rule that (|A.23j ) holds. Intu- 
itively, (vi,..., v n ; vt) is the tuple of subperiod(eval(st, h' p j uris (iT, n))) to which a refers. 
The last line of ( A.23j ) corresponds to the CONTAINS constraint in the WHERE clause of X. 



I should have used eval(st, A, (g db )? vi Vn - Vt \) instead of eval(st, X,g db ), to capture the 
fact that if there is any free column reference of a in A, this has to be taken to refer to the 



(vi, . . . , v n ;vt) tuple to which a refers. By (A.22), however, eval(st, A, (g db ! ' 



(vi,...,v n ;vt) / 

is the same as eval(st, A, g dh ). The second and third lines of ( A. 23 ) correspond to the 
restrictions of S\ and S2 (see also the comments about the equality predicate in section 
5.3.6| ). I do not include in the arguments of eval(st, h' cons (rj)) the assignment to the cor- 



relation names, because according to section [5j|, for r, G CONS, FCN (h' cons (Ti)) 

(A. 23) eval(st,T., g db ) = {(vi, . . . ,v n ;v t ) G subperiod(eval(st, h p j uns (ir,n))) \ 
if i G {1, 2, 3, ... , n} and Tj G CONS, then Vi = eval(st, h' cons (rj)), 
if i, j G {1, 2, 3, ... , n}, i < j, n = Tj, and Tj, Tj G VARS, then Vi = Vj, 
fD(v t )tf D (eval(st,\,g db ))} 



According to the definition of subperiod (section p.3.2[ ), (A. 24) holds iff (A.25) holds 
for some v' t . By the definition of h p f uns of section |5.9| , (A.25) is in turn equivalent to 
(A.26). 

(A. 24) (vi, . . . , v n ; vt) G subperiod(eval(st, h' p j uns (ir,n))) 

(A.25) (vi,...,v n ;v' t ) G eval(st, h' pfuns (7T,n)) and /d(^) C /dK) 

(A.26) (vi,. . . ,v n ;v' t ) G h p f uns (st)(TT,n) and f D {vt) E /r>K) 

Using the fact that (A. 24) holds iff (A.26) holds for some v' t , and the fact that for 
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Tj G CONS, eval(st, h' cons (Ti)) = h cons (st)(Ti) (section^), ( [A. 23 ) becomes: 

(A. 27) eval(st, £, g db ) = {(ui,... ; v n ; v t ) | for some v' t , 
(vt, . . . ,v n ;v' t ) G h pfuns (st)(ir,n), f D (v t ) Q /dK) 
if i G {1, 2, 3, ... , n} and Tj G CONS, then Uj = h cons (st)(Ti), 
if i, j G {1, 2, 3, ... , n}, z < j, t; l = Tj, and Tj, G l//li?S', then Vi = Vj, 
fD(vt)QfD(eval(st,X,g db ))} 

For every (v\, . . . ,v n ;v t ) G eval(st,T,, g db ), the /d(^) E foWt) i n the second line of 



( A. 27 ) implies that /d( u *) G PERIODS , which in turn implies that vt G -Dp. That 



is, all the time-stamps of eval(st,T>, g db ) are elements of Dp. ( |A.27 ) also implies that 



eval(st,Y l ,g db ) is a valid-time relation of n explicit attributes. Hence, eval{st, H,g ) 
VRELp(n), and clause 2 has been proven. 



Proof of clause 3 

Using the definition of ||tt(ti, . . . , Tn )\\M{st),st,et,lt,g ( sec tion |J), clause 3 becomes: 

(vi, . . . ,v n ;v t ) G eval(st,T,,g db ) iff for some g and p mx i: 
(A.28) g G G 

(A.29) Hnf ^ = . . . , ||T n || M ^ = f D (v n ) 

(A.30) G /^(si)(7r, nXHnH^K . . . , ||T n || M ^ ) 

(A.31) fD(v t )QPmxl 

(A.32) / D (t*) C/ D (et;aZ(^,A,<? dfe )) 



I first show that the forward direction of clause 3 holds. I assume that (v\, . . . ,v n ;vt) G 
eval(st, Tj, g db ). Then, ( |A.27| ) implies that for some v' t : 



(A.33) 
(A.34) 
(A.35) 
(A.36) 
(A.37) 



(v 1 ,... ,v n ;v' t ) G h pfuns (st)(7r,n) 
f D (v t ) □ f D (v' t ) 

if i G {1, 2, 3, ... , n}, and Tj G CONS, then Uj 



hcons v i. 



if i, j G {1, 2, 3, ... , n}, i < j, Tj = Tj, and Tj, Tj G K4i?S', then 



Vi 



V; 



fD(vt)Cf D (eval(st,X,g db )) 



To prove the forward direction of clause 3, I must prove that for some q and. Pmxh 
(|aJD - ( WM ) hold. I define the mapping g : VARS i-» as follows: 



<fe/ j fn{vi), if for some i G {1, 2, 3, ... , n}, (3 = Tj 
j o, otherwise 



where o is a particular element of OBJS, chosen arbitrarily. (A.28) follows from lemma 
A.5|, t he definition of g, and ( [A.36 ). I set p m xi as in _(Q8|) , and show that (|Al29D - 
( |A.32j ) also hold. ( |A.29|) follows from lemma A.2 , ( A.35| ), and the definition of g. 



(A.38) 



Pmxi 



f D (v' t ) 
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I now pr ove (|A.30 ). ( |A.29| ) (proven above) implies ( |A.3S|) . (A. 33) and ( |A.39|) i mply 
(|A.4C| ). (|A.4C| ) and the definition of f p f uns of section |5.8| imply ( A.41|) . flA.41|) and 

flO§ imply (ED- 



(A.39) 
(A.40) 
(A.41) 



-foil 

\M(st),g* 



(f D \\\ri\\ M{st) ' 9 ),---J D H 
f D (vi)ef pfum (st)(n,n)(\\r4 M ^, 



- \\M(st),g) 

;v' t ) G hp f uns (st) (it, n) 



(|A.31| ) follows from ( |A.34[) and (|A.38|) . I now prove (|A.32j) . (|A.34|) implies that 
fo(vt) 6 PERIODS. From the hypothesis, eval(st, X, g db ) G Dp, which implies that 
fD(eval(st, X, g db )) is a period or the empty set. f£)(eval(st, A, <7 )) cannot be the 



empty set, because according to ( A. 37 ) fo(vt) (which is a period and therefore a non- 
empty set) is a subset of f D (eval(st,X,g db )). Hence, f D (eval(st, X, g db )) G PERIODS. 
( A.37| ) and the fact that both fo(vt) and fo(eval(st, A, g rfb )) are periods imply ( A. 32 ). 
The forward direction of clause 3 has been proven. 



I now prove the backwards direction of clause 3. I assume that ( |A~28D - (EH) hold for 
some g and p mx i- I need to show that (v\, . . . ,v n ;vt) G eval(st, S, According to 
( A.27 ), it is enough to prove that for some ( A.33|) - ( A.37| ) hold. I set v' t = fp (Pmxl): 
which implies (A. 42). 



(A.42) 



Pmxl 



SdH) 



I first prove ( A.33| ). ( A.30| ), (A.42), and the definition of f p f uns of section 
(A.43). ( |A.29|) implies (A.44). (A.43) and (A.44) imply (|A.33|) . 



(A.43) 
(A.44) 



(fEHWnW^ 9 ), • • • JEH\\rn\\ M{st) ' 9 ); <) e ^(^n) 



/D 1 (lln|| M(si) ' 9 ) = ^i,...,/ D 1 



|itf(st),<n 



imply 



(EH ) follows from ( EUD and (A.42). I now prove ( EH )- If « 6 {1, 2,3, ... , n} and 
Tj G CONS, the semantics of Top implies (A. 45), and QA.29] ) implies (A. 46). 



(A.45) 
(A.46) 



r .\\M{st),g 



fD(Vi) 



(A.45) and (A.46) imply that fo(vi) = /consist) (Ti), which in turn implies (A. 47). 



The definition of f CO ns of section 5.8 implies (A. 48). (A. 47) and (A. 48) imply that 
Vi = h cons (st)(Ti). ( A.351 ) has been proven. 



(A.47) 
(A.48) 



Vi = f D 1 (fcons(st)(T i )) 
fcons(st)(Ti) = f Dih C onsist)iTi)) 



I now prove (EH)- If hj G {1,2,3, ... ,n} and n = tj, then \\n\\ M ^' 9 = \\T j \\ M W> 9 . 
Then, (|A.29|) implies that fo(vi) = foivj), which in turn implies that /^(/dC^O) = 
/d 1 (/d(^j))) he. Vi = Vj. ( A. 36 ) has been proven. ( A. 37 ) follows from ( A. 32 ). The 
backwards direction of clause 3 has been proven. 
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A. 3. 2 Culm[n(Ti, . . . ,r n )] 
Translation rule 

If 7r G PFUNS, ti, . . . , T n G TERMS, and A is a Tsql2 value expression, then: 

def 

trans(Culm[K(Ti, . . . , r n )], A) = 
(SELECT DISTINCT ai-1, a x .2, ... , a x .n 
VALID PERIOD (BEGIN (VALID (ai)) , END (VALID (ai) ) ) 
FROM (h' p j uns {it, n)) (ELEMENT) AS a x , 

(Kuim S (K,n)) AS a 2 
WHERE ai-1 = a 2 .l 
AND ai.2 = a 2 .2 



AND a\.n = a 2 .n 
AND ... 



AND ... 

AND A CONTAINS PERIOD (BEGIN (VALID (a x ) ) , END (VALID (ai) ) ) 

Each time the translation rule is used, ot\ and «2 are two new different correlation 
names, obtained by calling the correlation names generator after A has been supplied. 
The "... "s in the WHERE clause correspond to all the strings in S\ U S2, where Si and 



5*2 are as in section A. 3.1, except that a is now a.\. 



Proof that theorem 5^2 holds for (f> = Culm[ir{ri, . . . , r n )] 



I assume that 7r G PFUNS and n, ...,r„ G TERMS. By the syntax of Top, 
this implies that Cu/m[-7r(ri, . . . , r n )] G YNFORMS . I also assume that st G PTS, 
that A is a Tsql2 value expression, g db G G db , eval(st,\, g db ) G Dp, and that 
S = trans(Culm[ir(Ti, . . . ,r n )], A). By the definition of r . . . n , r Culm[ir(Ti, . . . ,T n )p = 
(ri, . . . , T n ). I need to show that the three clauses of theorem hold. 



Proof of clause 1 

The ai-1, a±.2, . . . , a.\.n in the SELECT clause of E, and the four VALID(ai) in the 
VALID and WHERE clauses are not free column references in E, because S is a binding 
context for all of them. For the same reason, all the column references of the form 
a.i (i G {1,2,3,... ,n}) in the WHERE clause of £ are not free column references in 
E. The only remaining parts of E where column references (and hence free column 
references) may occur are the h' p j uns {i:,n) and the h' culms {i:,n) of the FROM clause, the 
A of the WHERE clause, and the ft^CnO, Kons( T i 2 ), Konsin^), ••• , Kons(n m ) of the 
WHERE clause (h' cons (r^), ... , h' cons (T im ) derive from Si; T ix , . . . ,r im are all the Top 
constants among n, . . . , r ra ). By lemma [A.I , this implies that: 



(A.49) raV(£) C FCN(h' pfuns (n,n)) U FCN(h' culms (n,n)) U 

FCN(X) U FCN(h' cons {r n )) U . . . U FCN {h' cons {r lm )) 
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By section ^ for every k G CONS, FCN(h' cons (n)) = 0. Since T h ,...,r im G CONS 
(see comments above), FCN {h' cona {ji 1 )) = 0, ... , FCN (h' cons (Ti m )) = 0. Accord- 
ing to section |5.9| , it is also true that for every 7r G PFUNS and n G {1,2,3, .. . }, 
F CN(h' pfuns {n,n)) = and FCN(h' culms (Tr,n)) = 0. Hence, (A.49) becomes FCN(E) C 
FCN(X). Clause 1 has been proven. 



Proof of clause 2 



According to section |5.9| , h p f uns (ir,n) and h' culms (iT,n) are Tsql2 SELECT statements, 
^(/^ w (7T,n)) = FCiV(^ ms (7r,n)) = 0, euai^^^n)) G NVREL P {n), and 
eval(st, h' culms (-K,n)) G SREL(n). The ai of £ ranges over the tuples of the relation 
coalesce(eval(st, h' p j uns (TT , n))) (see section |5.2.4| ). From the definition of coalesce, it 
is easy to see that coalesce(eval(st, h' p f uns (ir,n))) is a valid-time relation that has the 
same number of explicit attributes as eval(st, h' p j uns {i[, re)), i.e. n. Hence, a\ ranges 
over tuples (v±, . . . ,v n ;v[) G coalesce(eval(st, h' p j uns {i:,n))). The 02 of £ ranges over 
the tuples of eval(st, h' culms (iT , re)) . Since eval{st,h' culms {i:,n)) G SREL{n), 02 ranges 
over tuples {vi, . . . ,v n ) G eval(st, h' culms (7r,n)). 

The ct\ and 02 of E are generated by calling the correlation names generator after A 
has been supplied. Hence, a\ and 0:2 cannot appear in A. The fact that a\ and 0:2 do 
not appear in A means that for every v\, . . . ,v n G D and every v' t G Dt- 

(A.50) eval^X,^)^ ,...,,„)) = euo ^> A ' 



The reader should now be able to see from the translation rule that ( [A. 51 ) holds 



Intuitively, {v\, . . . ,v n ;v' t ) is the tuple of coalesce(eval(st,h p j: uns (ir,n))) to which a\ 
refers, and (v\, . . . , v n ) is the tuple of eval(st, h' culms (it, n)) to which 0J2 refers. The last 



line of (A. 51) corresponds to the CONTAINS constraint in the WHERE clause of E. I should 



have used eval(st, A, ((g )? v .^/p/^ v \) instead of eval(st, X,g db ), to capture the 
fact that if there are any free column references of a± or 02 in A, these have to be 
taken to refer to the (v\, . . . ,v n ;v' t ) or (vi, . . . , v n ) tuples to which a\ and 02 refer 
respectively. By ( |A.50[ ), however, eval(st, X, ((s'*)^.. .^y))^,.. .,„„)) is the same as 



eval(st, X, g ). The fifth and sixth lines of ( A.51 ) correspond to the restrictions of S 



'1 



and £2 (see also the comments about the equality predicate in section 5.3.6| ). I do not 
include in the arguments of eval(st, h' cons (Ti)) the assignment to the correlation names, 
because according to section |5.9|, for Tj G CONS, FCN {h' cons (Ti)) = 0. 



(A.51) eval(st,T,,g ) = {(v\, . . . ,v n ;v t ) \ for some v' t 



(v 1, . . . , v n ; v' t ) G coalesce(eval(st, h' p f uns (-K,n))), 
(vi,...,v n ) G eval(st,h' culms (7r,n)), 
fD(v t ) = [minpt(f D (v' t )),maxpt(f D (v' t ))], 

if i G {1, 2, 3, . . . , re} and r« G CONS, then V{ = eval(st, h' cons (Ti)), 

if i, j G {1, 2, 3, ... , n}, i < j, n = Tj, and Tj, Tj G VARS, then = fj, 

f D (v t ) ^f D (eval{st,X,g db ))} 
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According to section 0|: 



(A.52) 
(A.53) 
(A.54) 



eval(st,ti pfuns (ir,n)) = h pfuns (st)(ir,n) 
eval(st,h' culms (n,n)) = h cvlms {st)(-K,n) 
for Ti G CONS, eval(st, h' cons (Ti)) = h cons 



Using ( |A.52| ), ( |A.53| ), ( |A.54| ), and the definition of coalesce of section |5.2.4j , ( |A.51| ) 

becomes: 

(A. 55) eval(st, S, g db ) = {(v±, . . . , v n ; vt) | for some v' t , v' t ', 
{v!,... ,v n ;v") G h pfuns (st)(TT,n), 

/dK) = U MO 

(vi,...,v n ;v' t ") G h pfuns (st)(ir,n) 

{vi,...,v n ) G h culms (st)(ir,n), 

f D {v t ) = [minpt(f D (v' t )),maxpt(f D (v' t ))], 

if i € {1, 2, 3, ... ,n} and Tj G CONS, then = h cons (st)(Ti), 



if i, j G {1,2,3, . . . ,n}, z < j, r, 
/D(^)C/ D ( e ^( S t,A,/ b ))} 



r 7 -, and Ti,Tj G V^4i?S', then 



According to ( |A.55 ), for every tuple (v\, . . . , v n ; v t ) G eval(st, X, g db ), there is a v^, such 
that fD(v't) is the union of all the temporal elements /d WO represented by time-stamps 
of tuples (vi, . . . ,v n ;v' t ") G h p f uns (st)(ir,n). {foWt) is n °t the empty set, because by 
the second line of ( |A.55[) , there is at least one tuple (v\, . . . ,v n ;v' t f ) G h p f uns (st)(-7T,n)). 
fo{vt) is the period [minpt(fp>(v' t )), maxpt{f oWt))}- This implies that v t G Dp. We 
have concluded that for every tuple in eval(st, S, g db ), the time-stamp v% is an element 
of Dp. ( A.55j ) also implies that eval(st,T>, g db ) is a valid-time relation of n explicit 
attributes. Hence, eval{st,T,,g db ) G VRELp(n). Clause 2 has been proven. 



Proof of clause 3 

Using the definition of || Culm[n(Ti, . . . , r n )] \\ M i st )M,et,it,g ( sec ti on p,9D , clause 3 be- 
comes: 



(v\, . . . , v n ;vt) G eval(st, H,g ) iff for some g and S: 

(A.56) <? G G 

(A.57) Hnir^ = f D ( Vl ), \\r n \\ M{st) ' g = f D (v n ) 

(A.58) f D {v t ) C / D (em/(si,A,/ 6 )) 
(A.59) n)([|ri||^' fl , • • • , ||r n || M ^) = T 

(A.60) 5/0 

(A. 61) foivt) = [minpt(S),maxpt(S)] 

(A.62) 5= |J p 



I first show that the forward direction of clause 3 holds. I assume that (vi, . . . ,v n ;vt) G 
eval(st,Ti, g db ). I need to show that for some g and S, ( [A.56 ) - ( A.62| ) hold. The 
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assumption that (ui, . . . ,v n ;vt) G eval(st,T,, g db ) and ( A.55| ) imply that for some v' t 



and v": 

(A. 63) (v!,... ,v n ;v") G h pfuns (st)(7r,n) 

(a.64) / D K)= u M«n 

(ui,...,«„;^") g h pfuns (st)(ir,n) 
(A.65) (Ul , . . . , U „) G /l c «Zms (si) (7T, n) 

(A.66) /r>(«t) = [minpt(f D (v' t )),maxpt(f D (v' t ))] 

(A. 67) if i G {1, 2, 3, ... , re} and t, G CONS, then v$ = h cons (st)(Ti) 

(A. 68) if i,j G {1, 2, 3, ... , re}, i < j, n = Tj, and Ti,Tj £ VARS, then Vi = Vj 

(A.Q9) f D (vt)tfD(eval(st,\,g db )) 

I define the mapping g : VARS t— > OBJS as follows: 

<fe/ J fo{vi), if for some i G {1, 2, 3, ... , n}, /? = Tj 



o, otherwise 



where o is a particular element of OBJS, chosen arbitrarily. (A. 56) follows from lemma 
ATS] flOf ), and the definition of g. I set 5 as in ([A?rOD , and show tliat ( |Q7D - ( |Q2| ) 



also hold. ( [A.57 ) follows from lemma A.2| , ( |A.67|) , and the definition of 5. 
(A.70) S = f D (v' t ) 



I now prove ( [A. 59 ), ( A.57| ) (proven above) implies (A. 71). (A. 71) and ( A.65| ) imply 
(A. 72). (A. 72) and the definition of f cu ims of section ^8| imply ( [A.59| ), 



(A.71) 
(A.72) 



fE L (\\ri\\ M{St) ' 9 ) = VI, • • • , fEH\\Tn\r™) = Vn 
(/^(Iklir^),...,/^^^!!^' 9 )) G h culms (st)(7r,n) 



\M(st), g , 



I now prove ( A.60| ). ( A.64] ) and ( |A.70[ ) imply ( |A.73| ). ( |A.63| ) implies that there is at 
least one tuple (v\, . . . ,v n ;v' t ') in h p f uns (st)(7r,n). Then, by ( A.73| ), S contains at least 
the chronons of foW/), and therefore S 7^ 0. (|A.60| ) has been proven. 



(A.73) 



5 



u 



fD(v' t " 



{v 1 ,...,v n ;v' t ") G h p j unB (st)(iT,n) 



I now prove flA||). Using (A.71), ( \k~7$ ) becomes (|A.74| ) . By the definition of f p f un s 
of section pTs| , ( |A.75 ) is equivalent to ( A.76| ). Then, ( A.74| ) can be written as ( |A.77[ ). 
By replacing fD(v' t ") with p, ( A.77|) becomes ( |A.62 ). 



(A.74) 

(A.75) 
(A.76) 
(A.77) 



s= U fM") 

(/ B 1 (||r 1 ||«^). 9 ),...,/- 1 (||r n ||M( S t), 9); „ r) e hpfuns{st )(n,n) 

(fB'iWnW^H ■ ■ ■ , fB\\\Tn\\ M{st) ' 9 ); <) g h pfuns (st)(n,n) 

f D (v' t ") G W^nXllnr^ . . . , ||r n || M ^) 

s= u foiv'n 

/D«')e/^ M ( 7 r,n)(||n|| M ( st )^,..,||r„|r( st ).s) 
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(A. 61) follows from ( A.7Q ) and ( |A.66| ). It remains to prove ( A.58Q . ( A.60| ) and 
( A. 61 ) (both proven above) imply that /d(w*) G PERIODS . From the hypothesis, 
eval(st, X, g db ) G Dp, which implies that fD(eval(st, A, g dh )) is a period or the empty 
set. f£)(eval(st, A, g db )) cannot be the empty set, because according to ( A.69| ) /d(^) 
(which is a period and therefore a non-empty set) is a subset of fjj(eval(st, A, g )). 
Hence, f D (eval{st, X, g db )) G PERIOD S. (|A.69|) and the fact that both / D (u t ) and 
fD{eval(st,X,g db )) are periods imply ( A.58| ). The forward direction of clause 3 has 
been proven. 



I now prove the backwards direction of clause 3. I assume that flAl>6;) - flOp hold. I 
need to prove that (v\, . . . ,v n ;vt) G eval(st,H, g dh ). According to (|A.55[) , it is enough 
to prove that for some v' t and v" , ( A. 63 ) - ( A.69Q hold. ( A. 69 ) follows from ( A. 58 ). 



I now prove (|A.67|) . If i G {1, 2, 3, ... , n} and n G CONS, then the semantics of Top 
implies (A.78), and ( |A.57|) implies (A.79). 

H^ill ^ — f consist) iri) 

\\n\\ M ^ = f D ( Vi ) 



(A.78) 
(A.79) 



(A.78) and (A.79) imply that /d(^) = Icons (st)(ri), which in turn implies (A. 80). 
The definition of f cons of section |5.8| implies (A. 81). (A. 80) and (A. 81) imply that 
Vi = h cons (st)(Ti). ( A.67] ) has been proven. 

(A.80) v i = fn\f cons (st)(T i )) 

(A.81) fcons(st)(ri) = fD(h cons (st)(Ti)) 



I now prove (|A.68| ). If i, j G {1, 2, 3, ... , n} and Tj = tj, then [|tj 



\M(st),g 



\M(st),g 



Then, from ( |A.57|) , /d(u») = /d(«j), which implies that f D l {fD{vi)) = f D 1 {fD{vj)), 



i.e. «j 



( |A.6§| ) has been proven. 



I now prove ( A.65|) . ( |A.59| ) and the definition of f cu ims of section |5]8| imply ( [A. 82 ). 
flAlSTl) implies flA]§ ). ( [Al^D and flAl^ ) imply ( |A^5| ). 



(A.82) 
(A.83) 



(/^(llTlir^^.^/^dlrnir^)) G fc^C^CTT.n) 
^ 1 (lln|| M(st) ' 9 )=^l,...,/D 1 (Kir (si) ' S )=«n 



It remains to prove that ( A. 63; ), ( A.64|) , and ( |A.66|) hold for some v' t ,v". I start with 
(|Al^ ). Acco rding to section |J, f pfuns {st)(7r, n)(||n || M ( S *)>9, . . . , ||r n || M ^) is a set of 
periods. By ( A.62| ), 5 is the union of all these periods. According to (|A.60[) , S is not 
the empty set, which implies that there is at least one period p' such that: 

(A.84) p' G Ujunsist^^nt 1 ^, \\r n \\ M{st) ' g ) 

Let v'l = /£>(?/); which implies that p' = /d(uJ'). Then, from ( |A.84 ) we get: 



(A.85) 



fo^) e f pfuns ( S t)(-K,n)(\\ri\\ M ^, 



\M(st),g 



(|A.85| ) and the definition of f p f uns of section imply ( |A.86[ ). (|A.86|) and ( A. 83; ) imply 

(B )- 



(A.86) 



(/o'dlnir^' 9 ), • • • , /^(Iknir^' 9 ); v'l) G ^( S i)(7r,n) 
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From the discussion above, S is a (non-empty) union of periods. This implies that S 
is a temporal element, which in turn implies that there is a v' t G Dt, such that ( |A,87 ) 
holds. flOD follows from ( [OTP and (|A~87] ). 



(A.87) f D (v f t ) = S 

I now prove flOg ). ( ggg ) and flOTD imply ([O 



(A.« 



/oK) U p 

pe/ p /„ res (st)(7r,n)(||Ti|| A - f ( st )>9,...,||r„||An^),9) 



Tn ||M(st),^ is a get 



According to the discussion above, / p / im s(sO(^, n )(||''~i|| M< ' s ^' 5 \ . . 
of periods. Therefore, for every p in that set, there is a v' t " G Dp, such that foWl') = P 
Hence, ( A. 88 ) can be written as ( A.89| ). 



(A.89) 



fD(v' t ) = U 

/d «') 6 / P / Ims (st)K«)(l|Ti ||M(»*),S, 



|Af(st).9) 



By the definition of f p f un s of section |5.8j , ( A.90|) is equivalent to (|A.91|) . By ( [A. 83 ), 
(A.91) is in turn equivalent to ( A.92| ). ( |A.89 ) and the fact that ([A.90| ) is equivalent to 
( A. 92 ) imply ( A. 64 ). The backwards direction of clause 3 has been proven. 



(A.90) 
(A.91) 
(A.92) 



f D (v' t ") € f pf uns(st)(ir,n)(\\n\\ M ^, ||r n || M ^) 

(f D H\\n\\ Mist) > 9 ), ■ ■ ■ , fE\\\Tn\\ Mist H v't') G h pfuns (st)(7r,n) 
{vi,...,v n ;v' t ") G hpf uns (st)(ir,n) 



A. 3.3 0i A 02 
Translation rule 

If 0i, 02 G YNFORMS and A is a Tsql2 value expression, then: 
trans (0i A 02, A) d = 

(SELECT DISTINCT ai-1, ai-2, ... , a\.ny, a 2 .l , a 2 .2, ■■■ , a 2 .n 2 
VALID VALID (ai) 

FROM trans(4>i, A) AS a,\ , trans((j) 2 , A) AS a 2 
WHERE . . . 
AND ... 

AND ... 

AND VALID(ai) = VALID(a 2 )) 

n\ and 77-2 are the lengths of r 0i n and r 02 n respectively. Each time the translation 
rule is used, a\ and 02 are two new different correlation names, obtained by calling the 
correlation names generator after trans(<j)i, A) and irans(02,A) have been computed. 
Assuming that r 0i n = {t\, . . . , r^) and r 2 n = (rf , . . . , r^ 2 ), the "... "s in the WHERE 
clause are all the strings in S: 



S = {"ai.i = a 2 .f | i G {1,2,3, ... ,ni}, j G {1,2,3, . . . ,n 2 }, 
t\ = r|, and t},t] G VARS} 
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Proof that theorem 15.21 holds for 4> = <j)\ A fa, if it holds for 



and 



I assume that 4>i,(p2 G YNFORMS . By the syntax of Top, this implies that <fii A 
4>2 S YNFORMS . I also assume that st G PTS, A is a Tsql2 value expression, 
g db G G db , eval(st,X,g db ) G D* P , ^ = ( r i, . . . , T ^), r^i = ^ . . . , ^ and s = 
trans(4>iA(j)2, A). From the definition of r . . . n , it should be easy to see that r 4>iA(p2~ 1 = 
: T ni > r i ' • • • ' T n9 ) • Finally, I assume that theorem |5.2| holds for (f> — (f>i and 



i>2- I need to show that: 



1. FCN(E) C FCN(X) 

2. eval(st,T l ,g db ) £ VREL P { ni +n2) 

3. (uj, . . . jW^, vf, . . . , u 2 2 ; Vt) G eval(st, T,,g db ) iff for some g £ G: 

llr i {l M(st), 9 = Mvl)i ... 5 iir^r^)^ = 
ii7fr^=/i?(«?),... , iir n 2 2 ir^ = Mvi 2 ), 

\\4>1 /\(f) 2 \\ M ( st )^fD(vt),fD(eval(stX9 db )),g = 



Let Si and £2 be the embedded SELECT statements in the FROM clause of £ (i.e. Si = 
trans((pi, A) and £2 = trans(cf>2, A)). From the hypothesis, 4>i,(j)2 G YNFORMS , st G 
PTS, ^ = (ri 1 , ...,<), = (ri 2 , . . . ,t4), 17* G and eua Z( S i, X,g db ) G 
Then, from theorem for <^ = <p\ (according to the hypothesis, theorem [T^ holds for 
4> = 4>\ and 4> = 4>2) we get: 

l 1 . FCN(H X ) C FCiV(A) 

2 1 . eval(st, T,i,g db ) G VREL P ( ni ) 

3 . (wj, . . . , ; ft) G eval(st, T> x ,g db ) iff for some (71 G G: 

Wf M *=/ B (4- , lKii M(st) - 91 = /d«), 

||0 i ||M(si), S t,/ D (^)Jl3(e^(^A, 5 , d '')), 9 i = ^ 



and from theorem 5.2 for 



l 2 . FCN{Y> 2 ) C FCW(A) 

2 2 . eval(st,^2,9 db ) G VREL P (n 2 ) 

3 2 . (w 2 , . . . , t> 2 2 ; u t ) G eval(st, T. 2 ,g db ) iff for some 52 G G 
|| r 2||M(^ 92 =/fl(0 2 )j ... , ||r 2 2 r^ =/d«), 
II j, ||M(si), S tJ D (uOJzj(eua/(siA9 df, )),g 2 = J-. 



Proof of clause 1 

The two VALID(ai) in the VALID and WHERE clauses of E, and the VALID(a 2 ) in the WHERE 
clause of E are not free column references in E, because E is a binding context for all of 
them. The ct\.l, . . . , a\.n\, c^.l, • • • , 0:2.712 in the SELECT clause of E, and any column 
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references of the form a\.i or a 2 -j in the WHERE clause of £ (i G {1,2,3, . . . ,ni}, j G 
{1, 2, 3, . . . , 712}; these column references derive from S) are not free column references 
in £ for the same reason. E contains no other column references (and hence no other 
free column references), apart from those that possibly appear within Si or £2. By 
lemma \A.1[ this implies ( |A.93|) . flA,93j ), clause l 1 , and clause l 2 imply clause 1. 

(A.93) FCN(E) C FCW(£i) U FCN(T, 2 ) 



Proof of clause 2 



When computing eval(st,T>, g ), a\ and 02 range over the tuples of eval(st, £1,5 ) 
and ev al(st, £2, 9 db ) respectively. Clauses 2 1 and 2 2 imply that eval(st,T,i, g db ) and 
eval(st,Ti2, g db ) are valid-time relations of n\ and n 2 explicit attributes respectively. 
Hence, ot\ ranges over tuples (v{, . . . , v\ \ v\) G eval(st,T>i, g db ), and a 2 ranges over 
tuples (vi, ...,v% 2 ;vt) G eval(st, H 2 ,g db ). 

ot\ is generated after computing £2 = trans((f>2, A) (see the translation rule). Hence, a± 
cannot appear in £2- Since a% does not appear in £2, for every tuple (v{, . . . ,v^;vj): 

(A.94) e W a/( S t,£ 2 ,(/ 6 )^ t vln . vl) ) = eval(st,X 2 ,g db ) 



It should now be easy to see from the translation rule that ( [A. 95 ) holds, (uj, . . . , ; t>t) 
is the tuple of eval{st,T,i,g dh ) that corresponds to ai, while (vf, . . . ,v^ 2 ;vt) is the 
tuple of eval(st,T,2, g db ) that corresponds to «2- The last constraint in the WHERE 
clause of £ requires the time-stamps of the two tuples to be identical. The last con- 
straint in QA,95| ) corresponds to the constraints in the WHERE clause of £ that de- 



rive from S (see also the comments about the equality predicate in section |5.3.6|) . 



I should have used ev al(st, £2, (9 db )^\ 1 J instead of evaUst, £2, g db ), to cap- 

(v^ ,...,v ni ;vt) 

ture the fact that if there is any free column reference of a\ in £2, this has to be 
taken to refer to the (v{ , . . . , v^; vt) tuple to which a\ refers. By ( A.94 ), however, 
eval(st, £2, {g db )'^ J i v i , Vt j) is the same as eval(st, £2, g db )- 

(A.95) eval(st,Z,g db ) = {(vl...,vl 1 ,vl...,vl 2 ;v t ) | 
(v\, ...jV^Vt) G eval(st,J: 1 ,g db ), 
(vj, ...,vl 2 ;v t ) G eval(st,T l2 ,g db ), and 

if i G {1,2,3, . . . ,m}, j G {1,2,3, ... ,n 2 }, r/,r? G VARS, and r\ = if, 
then v} = Vj} 



According to ( A.95| ), in every tuple of eval(st,T,, g db ), the time-stamp vt is also the 



time-stamp of a tuple in eval(st, £1, g ). By clause l 1 , eval(st, T>i,g db ) G VRELp(ni) 
which implies that vt G Dp. Hence, all the time-stamps of eval(st, T>,g db ) are elements 
of Dp. ( A.95] ) also implies that eval(st, £, g db ) is a valid-time relation of ni+n 2 explicit 



attributes. Hence, eval(st,Y>, g ) G VRELp(ni + n 2 ). Clause 2 has been proven. 
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Proof of clause 3 



Using the definition of \\4>i A 2 \\ M ( st )> st > et > lt >9 (section clause 3 becomes: 

{v\, . . . , , v±, . . . , v\ 2 \ vt) G eval(st, E, g db ) iff for some g : 
(A.96) g G G 

(A.97) \\rl\\ M ^ = f D (v{), iKir^^ = f D (vl) 

(A.98) \tf\\ M ™* = f D (vl), \\rl 2 \\ M ^ = f D {vl 2 ) 

(A. 99) \\0 1 \\ M (st),st,f D (vt),fn(eval{st,\,g db )),g _ rp 

(A. 100) \\ ( j )2 \\M{st),st,f D (v t )J D {eval(st,\,g db )) ) g _ j, 



I first prove the forward direction of clause 3. I assume that ( A.101| ) holds. I need to 
prove that for some g, ( |A.96| ) - ( A.100| ) also hold. 



(A.101) 



(v\, . . .,v l ni ,vl, . . . ,vl 2 ;v t ) G eval(st,^,g db ) 



(|A.101|) and (|A~95| ) imply that: 

(A.102) (v\,...,v l ni ;v t ) G eval(st^ l)g db ) 
(A.103) (vl...,v 2 n2 ;v t ) G eval(st,Z 2 ,g db ) 
(A.104) ifi€{l,2,3,...,ni}, j G {1,2,3, 



, n 2}, T i> T j £ VARS, and r i = if, 



then v. 



( |A.102| ) and clause 3 1 imply that for some g±: 



(A. 105) 
(A.106) 

(A.107) 



9i € G 



l\\M(st),gi 



f D {v\) 



|| 1 ||Af(st),gi 

J 1 1 711 II 



\M(st),st,f D (v t ),f D (eval{st,\,g db )),gi 



n\\ - " — - - ■ ~ =T 
Similarly, ( A.103| ) and clause 3 2 imply that for some g 2 : 

(A.108) 
(A.109) 

(A.110) 



92 G G 

\\rt\\ M ^ = fn(vl), . . . , \\rl 2 \\ M ^ = / «) 

H i )a ||Af(st),st,/ D (i.0,/i3(ei;aZ( S t,A, 5 , tit ')), 9 2 = T 



I define the mapping g : VARS i— ► OBJS as follows: 



£fe/ 



gi(/3), if for some i G {1, 2, 3, ... , ni}, /3 = r/ 
32 (/?), if for some j G {1, 2, 3, ... , n 2 }, /? = r| 
o, otherwise 



where o is a particular element of OBJS, chosen arbitrarily. ( A.96 ) follows from lemma 
A6[ the definition of g, ( |A.104|) , (|A.106|) , and (|A.109|) . 



The definition of g implies that g and gi assign the same values to all the variables 
among t\,. . . , . The assumption that r (pi~ 1 = (t\, . . . , r„ ) and the definition of r 0i n 
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imply that all the variables of ft are among t\, . . . , 7~ . Hence, g and g± assign the 
same values to all the variables in (f)\. ( A.97| ) follows from lemma A, 3 , the assumption 
that (ti, . . . ,r„ 1 ) = r ft~ l , ( A.106| ), and the fact that g and g\ assign the same values 
to all the variables of ft. The proof of ( A. 98 ) is very similar. 



I now prove ( 
of ft implies 

similar. This concludes the proof of the forward direction of clause 3. 



i.99|). The fact that g and g\ assign the same values to all the variables 
rTlTI). ( |A~TTl|) and (|A~10^ ) imply ( |A~99|) . The proof of ( |A~Too|) is very 



(A.lll) 



\M(st),st,f D (vt),f D (eval(st,X,g db )),g 



\M(st),st,f D (vt),f D (eval(st,\,g db )),g 1 



I now prove the backwards direction of clause 3. I assume that for some g, (A.96) 
- flA.lOCD hold. I need to prove that (v\, . . . , v^, vf , . . . , v^ 2 ; ify) G eval(st,T., g db ). 
According to flOgD , it is enough to prove flOO^ ) - flOO^) . Clause 3 1 , (|A^D , fl07D , 
and imply (|A.102|) . Clause 3 2 , JOED , flOgp , and flO0C| ) imply (|AT03D . 



It remains to prove ( A.104]) . Let us assume that i G {1, 2, 3, ... , ni}, j G {1, 2, 3, 
r^rj 6 and r/ = rf. (|AT97D and (^9§ imply that / D l 

in turn implies that f^ifoiv})) = f^ifoivj)), i.e. that 
backwards direction of clause 3 have been proven. 



v} 



Id{v 



■ ,n 2 }, 
This 



( |A.104| ) and the 



A. 3.4 Pres[(f)'} 
Translation rule 

If (f)' G YNFORMS and A is a Tsql2 value expression, then: 
trans(Pres[(f>'], X) = 

(SELECT DISTINCT a.l, a. 2, ... , a.n 
VALID VALID (a) 
FROM transit', A) AS a 

WHERE VALID (a) CONTAINS TIMESTAMP 'now') 

n is the length of r (f)'~ l . Each time the translation rule is used, a is a new correlation 
name, obtained by calling the correlation names generator. 



Proof that theorem 5.2 holds for cj) = Pres[(p'}, if it holds for 



I assume that ft G YNFORMS. By the syntax of Top, this implies that Pres[ft] G 
YNFORMS. I also assume that st G PTS, A is a Tsql2 value expression, g db G G db , 
eval(st, A, g db ) G Dp, r (j) n = (t%, T3, . . . , r n ), and S = trans(Pres[ft], A). From 
the definition of r . . . n , it should be easy to see that r Pres[ft]~ 1 = r (p n = (t\, . . . ,r n ). 



Finally, I assume that theorem 5.2 holds for (j) = ft . I need to show that: 



1. FCN{Y>) C FCN(X) 

2. eval(st,Z,g db ) G VREL P (n) 
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3. (vi,..., v n ; vt) G eval(st, £, iff for some g G G: 
|| ri ||M(^), 9 = ^ ll^ll^^ = /dK), and 

||p res U'l||M(st),st,/ D (t) t ),/i3(ef^(st,A,5i df, )),g _ y 

Let £' be the embedded SELECT statement in the FROM clause of S, i.e. £' = trans(<j>', A). 
From the hypothesis, 0' G YNFORMS, st G PTS, r c£ n = (r x , . . . , r n ), g db G G d6 , and 
eval(st, A, G -Dp. Then, from theorem 52 for ^ = <// (according to the hypothesis, 
theorem |5.2| holds for (j> = 0') we get: 

1'. FCN(E') C FCN(X) 

2'. eval(st,Z',g db ) G VREL P (n) 

3'. (vi, . . . , w n ; ft) G eval(st, T,',g db ) iff for some g G G: 
|| ri ||M( S t), 9 = /d(ui); . . . ; ||r n || M ^)^ = / D K), and 

^/^M(st),stJ D {vt)J D (eval(st,X,g db )),g = j, 



Proof of clause 1 

The two VALID (a) in the VALID and WHERE clauses of £ are not free column references 
in E, because £ is a binding context for both of them. The a.l, . . . , a.n in the SELECT 
clause of £ are not free column references in £ for the same reason. £ contains no 
other column references (and hence no other free column references), apart from those 
that possibly appear within £'. By lemma A.l , this implies ([A.112j ). ( |A.112| ) and 
clause 1' imply clause 1. 

(A.112) FCN(E) C FCN(E') 

Proof of clause 2 

Given that eval(st, £', g ) G VRELp(n) (from clause 2'), it should be easy to see from 
the translation rule that: 

(A.113) eval(st,Z,g db ) = {(v u . . . ,v n ;v t ) G eval(st, £', g db ) \ st G f D {v t )} 



By ( A.113 ), any time-stamp vt in eval(st, £, ) is also a time-stamp in eval(st, £', 5 ). 
This implies that ^ G -Dp, because eval(st,T>' , g db ) G VRELp(n). ( |A.113j ) also im- 
plies that eval(st,T,, g db ) is a valid-time relation of n explicit attributes. Therefore, 
eval(st,T,,g db ) G VREL P (n). Clause 2 has been proven. 

Proof of clause 3 

Using the definition of || Pres [0'] || M ( s *)> s *: ei >'*,9 (section |3?7|) , clause 3 becomes: 

(A. 114) (vi, . . . ,v n ;v t ) G eval(st,Y,,g db ) iff for some g £ G : 

\\ri\\ M ^ 9 = Mvt), ||r n || M ^ = / D K), 

^ G /D(«t), and \\^\\M(st),stjD(vt),f D (eval(st,X,g^)),g = rp 
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I first prove the forward direction of (A. 114). If (vi, . . . , v n ;vt) £ eval(st, S, g ), then 
according to ( [A. 113 ): 



(A.115) 
(A.116) 



(vi, . . .,v n ;v t ) £ eval(st,T,',g db ) 
st £ f D (v t ) 



( [A.115[ ), clause 3', and ( A.116 ) imply that for some g £ G: 



(A.117) 



|M(st) l9 



|M(st),g 



fD(v n ) 



st £ fn{vt), and 



/ /||M(st),si,/ D (ut)J r) (emZ(s^A, S d6 )), S 



T 



The forward direction of ( |A.114| ) has been proven. I now prove the backwards di- 
rection of ( A.114 ). If for some g £ G, ( A.117 ) holds, then according to clause 3', 
( [AlT8|) is true. Also ( |AlT7| ) implies ([AT[|). (|ATr8|) , ( |AlT9| ), and ( [ATlD imply 
that . . . , v n ; vt) £ eval(st,Y>, g db ). The backwards direction of ( |A.114j ) has been 
proven. 



(A.118) 
(A.119) 



(vi, . . .,v n ;v t ) £ eval(st,T,',g db ) 
st £ fD{v t ) 



A. 3.5 Past\J3,<j/\ 
Translation rule 

If P £ VARS, (f)' £ YNFORMS, and A is a Tsql2 value expression, then: 
trans(Past[p, <//], A) d = 

(SELECT DISTINCT VALID (a) , a.l, a. 2, ... , a.n 
VALID VALID (a) 
FROM trans(<f/ , A') AS a) 

A' is the expression INTERSECT (A, PERIOD (TIMESTAMP 'beginning', TIMESTAMP 'now' - 
INTERVAL ' 1 ' x) ) , X is the TSQL2 name of the granularity of chronons (e.g. DAY if 
chronons correspond to days), and n is the length of r 0' n . Each time the translation 
rule is used, a is a new correlation name, obtained by calling the correlation names 
generator. 



Proof that theorem ^2 holds for <fi = Past[P,(f>'], if it holds for 



I assume that P £ VARS and dp' £ YNFORMS. By the syntax of Top, this implies 
that Past[P,(p'} £ YNFORMS. I also assume that st £ PTS, A is a Tsql2 value 
expression, A' is as in the translation rule, g db £ G db , eval(st, A, g db ) £ Dp, r (f>'~ 1 = 
{t~i,T2,T3, . . . , r n ), and S = trans(Past[P, <f>'], A). From the definition of r . . . n , it should 



be easy to see that r Past[P, c//]" 1 = (/?, n, . . . , T n ). Finally, I assume that theorem 5.2 
holds for 6 = 6' . I need to show that: 



1. FCN(E) C FCN(X) 



APPENDIX A. TRANSLATION RULES AND PROOFS FOR CHAPTER ?? 340 



2. eval{st,T l ,g db ) G VREL P (n + 1) 

3. («,«!,..., t> n ; f t ) G eval(st, S, g rfb ) iff for some g £ G: 

P||M( S t), 9 = || ri ||M( st ), g = ; || Tn ||Af(-t),p = /D ( Un ) ) and 

|| Past (j)']\\ M ( st )' st 'fD{vt)jD(eval(st,X,g db )),g _ y 



The definition of A' implies that any free column reference in A' is situated within the 
A of A'. By lemma AA, this implies ( A.120 ). Also, the syntax of Tsql2 and the fact 
that A is a value expression imply that A' is a value expression as well. 



(A.120) 



FCN(X') C FCN(X) 



The assumption that eval(st, A, g db ) G Dp and the definition of A' imply ( A.121| ) and 
that eval{st, A' , g db ) G D* P . 



(A.121) 



f D (eval(st, X',g db )) = f D (eval(st, A, g db )) n [*/«•«*,«*) 



Let £' be the embedded SELECT statement in the FROM clause of S, i.e. £' = trans(cj)', A'). 
From the hypothesis, (/>' G YNFORMS, st G PTS, r n = (n, . . . , r n ), and g db G G*. 
From the discussion above, A' is a value expression, and eval(st, X' , g db ) G L>J>. Then, 
from theorem 5.2 for <fi = <fi' (according to the hypothesis, theorem holds for <p = (/)'): 



1'. FCTV(S') C FCN{X') 

2'. eval(st,Z',g db ) G VREL P (n) 

3'. (vi,..., w ra ; ^t) G eval(st, <7 d6 ) iff for some </ G G: 

|| Tl ||M( s t), ff ' = /d(ui); _ _ ; || rn ||M (s t)y = /D(Vn)) and 

||0/||A/(st), S t,/ D (i.O,/ D (em«(st,A',g db )),g' _ T 



Proof of clause 1 



The two VALID (a) in the SELECT and VALID clauses of E are not free column references 
in S, because S is a binding context for both of them. The a.l, . . . , a.n in the SELECT 
clause of £ are not free column references in £ for the same reason. £ contains no 
other column references (and hence no other free column references) , apart from those 
that possibly appear within £'. By lemma A.l, this implies ( |A.122| ). (A. 122), clause 
1', and ( A.120 ) imply clause 1. 



(A.122) 



FCN(E) C FCN(E') 



Proof of clause 2 

According to clause 2', eval(st,T, f , g ) G VRELp(n). Then, from the translation rule, 
it should be easy to see that flA~123|) holds. 

(A.123) eval(st,X,g db ) = {(v t , v u ...,v n ; v t ) \ 

(vi, . . .,v n ;v t ) G eval(st,T,',g db )} 
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( [A. 123 ) implies that all the time-stamps vt of eval(st,Ti, g db ) are also time-stamps of 
eval(st,T l ',g db ). This implies that v t £ D P , because eval(st,T,' , g db ) £ VREL P (n). 
(A. 123) also implies that eval(st,Yl, g db ) is a valid-time relation of n + 1 explicit at- 
tributes. Therefore, eval(st,Y,, g db ) £ VRELp{n + 1). Clause 2 has been proven. 



Proof of clause 3 



Using the definition of \\Past[P,4>']\\ st,et ' lt ' 9 (section |3~8[) and the fact that 
g((3), clause 3 becomes: 



M{st),g 



(A.124) 
(A.125) 
(A.126) 
(A.127) 
(A.128) 



(v , v\, . . . , v n ; vt) £ eval(st, S, g db ) iff for some g : 
g{(3) = f D {v) 

|| ri ||M( st ),, = fn{vil _ \\ Tn \\ M ^ = f D (v n ) 
g(P) = f D (v t ) 

U ( j ) l\\stJ D (v t )J D (eval(stXg db ))^[tfirstM),g — J> 



I first prove the forward direction of clause 3. I assume that (v, v\, . . . , v n ; Vt) £ 
eval(st,T 1 ,g db ). I need to prove that for some g, ( |A.124|) - ( |A.128| ) hold. The as- 



sumption that (v,v\, . . . , v n ;vt) £ eval(st, S, g db ) and ( A.123| ) imply that: 
(A. 129) v = v t 

(A.130) ( Vl ,...,v n ;v t ) £ eval{st,Z',g db ) 



(A. 130) and clause 3' imply that for some g': 



(A.131) 
(A.132) 
(A.133) 



g'eG 



\M(st),g> 



fo(vi), . . • , \\r n 



\M(st),g' 



fD(v n ) 



l) /uM(st),stJ D (v t ),f D (eval(st,X',g db )),g' = j, 



Let g = (9')f D f Vt y Clause 2 (proven above) and the assumption that (v, v\, . . . , v n ; vt) £ 
eval(st, E, g db ) imply that vt £ Dp, a fact that along with lemma |A.7|, th e assumption 
that (3 £ VARS, (|ATl3ll ), and the definition of g imply ([AT24]). (|A.127| ) follows from 
the definition of g. ( |A.125| ) follows from (jAT27|) and ( [A329D . 



(|A.133|) and (|A.121|) imply (|A.134|) 

(A. 134) \\(p'\\M(st),stJ D (vt)jD(eval(st,\,g db ))n{t first ,st),g' 



The syntax of Top (section [T2|) and the fact that Past[(3,(j)'} £ YNFORMS, imply 
that (3 does not occur in <f>'. This and the definition of g imply ( A.135| ). ( [A. 134 ) and 
([Al35D imply ( |A~128|) . 



(A.135) 



f) t^M(st) y stJ D (vt),fD(eval{st,\,g db ))n[t flrat ,st),g> = 

^^M(st),stJ D (v t ),f D (eval(st,X,g db ))n[t f „. st ,st),(g'f fD(vt) 

h ruM(st),st,f D (v t )J D (eval(st,X,g db ))n[t f „. st ,st),g 
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(A. 126) follows from lemma A. 3, the assumption that r do'~ l = (n, . . . ,T n ), (A. 132), and 
the fact that g and g' assign the same values to all variables, possibly apart from 0, 
which does not occur in do'. The forward direction of clause 3 has been proven. 



I now prove the backwards direction of clause 3. I assume that (|Al24D - dAl2£| ) 
hold. I need to prove that (v, v\, . . . , v n ; vt) £ eval(st,T,, g db ). According to ([A.123j ), 
it is enough to prove ( A.129| ) and (|A.13C| ). According to clause 3', in order to prove 
([A. 130 1), it is enough to prove flOgg ) - flQHj ). (03|)and ( |A.137| ) are the sam e as 
(|A.124|) and ( |A.126j) , which were assumed to hold. (|A.138|) follows from ( |A.128|) and 



(|Al2l|) 



(A.136) 
(A.137) 
(A.138) 



9 £ G 

\M(st),g 



n||™ = f D (vx), . . . , ||r n || M ^ = f D (v Tl 

fuM(st),stJ D (v t ),f D (eval(st,X',g db )),g _ T 



It remains to prove dA.129| ). From ( |A.125| ) and ( [A. 127] ) we get /o(«t) = fo(v), which 
implies that /^ 1 (/d(^)) = /^ 1 (/d(^)) ) i-e. v = vt- Hence, (|A,129| ) holds. The back- 
wards direction of clause 3 has been proven. 



A. 3.6 Perf[(3,(f)'} 
Translation rule 

If £ VARS, 4>' £ YNFORMS, and A is a Tsql2 value expression, then: 
trans(Perf[j3, <fi'], A) d = f 

(SELECT DISTINCT VALID (a) , a.l, a. 2, ... , a.n 
VALID INTERSECT (A, PERIOD (END (VALID (a) ) + INTERVAL '1' x> 

TIMESTAMP 'forever')) 

FROM trans(4>' ,\ imt ) AS a 
) (SUBPERIOD) 



Xinit is as m section 5.10, \ is t ne TSQL2 name of the granularity of chronons, and n is 
the length of r <j)'~ l . Each time the translation rule is used, a is a new correlation name, 
obtained by calling the correlation names generator after A has been supplied. 



Proof that theorem 5.2 holds for (f> = Perf [(3, <p'], if it holds for 



I assume that £ VARS and dp' £ YNFORMS. By the syntax of Top, this implies that 
Perf[0, do'] £ YNFORMS . I also assume that st £ PTS, A is a Tsql2 value expression, 
Xinit is as in section g db £ G db , eval(st, A, g db ) £ D* p , r 0' n = (n, r 2 , r 3 , . . . , r n ), 
and X = trans(Perf[0,4>'], X). From the definition of r . . . n , it should be easy to see 
that r Perf[0, do']" 1 = (0, n, . . . , r ra ). Finally, I assume that theorem 5^2 holds for do = do' . 
I need to show that: 



1. FCN(E) C FCN(X) 
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2. eval{st,T l ,g db ) G VREL P (n + l) 

3. (v, vi, ... , f n ; vt) G eval(st, £, 5*) iff for some g 6 G: 
||/3|| M(si) ' 9 = /oW, ||ri|| M ( st )^ =«i,..., ||r n || M (^ = v n , and 
||Per/[/3, ^'Jll^C^.^./o^OJuCe^Cs*^./ 6 ))^ = t 

According to the syntax of Tsql2, \i n it is a value expression. By lemma |5T| , (A. 139) 
- (A.141) hold. 

(A.139) f D (eval(st,X tmt ,g db )) = PTS 

(A.140) eval(st,X mit ,g db )eD* P 
(A.141) FCN(X inU ) = 

Let E' be the SELECT statement in the FROM clause of E, i.e. £' = trans((f>', Xmit)- From 
the hypothesis, <p' G YNFORMS, st G PTS, r c// n = (n, . . . ,r„), and 5* G G db . From 
the discussion above, Xmit is a Tsql2 value expression, and eval(st, Xmu, g db ) G -Dp. 
Then, from theorem |5.2| for (j) = <f>' (according to the hypothesis, theorem |5.2| holds for 
(j) = 4>'), and using (A.139) and (A.141), we get: 

1'. FCN(E') C 

2'. eval(st, X/, g db ) G VRELp(n) 

3'. . . . , v n ; v' t ) G eval(st, S', <7 d6 ) iff for some 5' G G: 
\\ Tl \\M(stl 9 ' = fD(vi)} ; || rn ||^)^' = /dK ), a nd 



Proof of clause 1 

The two VALID (a) in the SELECT and VALID clauses of E are not free column references 
in E, because E is a binding context for both of them. The a.l, . . . , a.n in the SELECT 
clause of E are not free column references in E for the same reason. E contains no 
other column references (and hence no other free column references), apart from those 



that possibly appear within E' or the A in the VALID clause of E. By lemma A.l , this 
implies (|A.142| ). Clause 1' and dA.142|) imply clause 1. 



(A.142) FCN(£) C PCiV(E / ) U FCN(X) 



Proof of clause 2 

When computing eval(st, E, g db ), the a of E ranges over the tuples of eval(st, £', g db ). 
By clause 2', this implies that a ranges over tuples of the form (vi, . . . ,v n ;v' t ), where 
v' t G Dp. a is generated by calling the correlation names generator after A has been 
supplied. Hence, a cannot appear in A. The fact that a does not appear in A means 
that for every tuple (v±, . . . , v n ; v' t ): 

(A.143) eval(st, A, (s*)^,...,^)) = eval(st, A, g db ) 
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The reader should now be able to see that ( A.144| ) holds. Intuitively, (v\, . . . , 



is the tuple to which a refers. For each a-tuple {v\, . . . ,v n ;v' t ), the SELECT and VALID 
clauses of E generate a new tuple (v , v%, . . . , v n ; v'{), where v = v' t (the old time-stamp), 
and v'{ (the new time-stamp) represents the intersection of eval(st, A, g db ) with the 
period (or empty set) that begins immediately after the end of /d(^) and ends at 
the end of the time- axis. The restriction /d(u") ^ is needed to capture the fact 
that when evaluating E, new tuples of the form (v,v\,. . . ,v n ;v' t ') are automatically 
removed from the resulting relation if foWl) = 0- (The time-stamps of valid-time 
relations must represent temporal elements, which are non-empty sets of chronons; see 
section |5.2.3| .) 

(A. 144) eval(st,T,, g ) = subperiod({{v , v\ , . . . ,v n ;v[ f ) | for some v' t , 

(v 1 , . . .,v n ;v' t ) £ eval(st,J:',g db ), 
/d«) = f D {eval{st, A, g db )) n {maxpt(f D (v' t )), i {ost ], 
/oKO + and v = v 't}) 

In (|A.144|) , I should have used eval(st, A, (g db )? Vl Vn - V >\) instead of eval(st, X,g db ), to 
capture the fact that if there is any free column reference of a in A, this has to be 
taken to refer to the (v±, . . . ,v n ;v' t ) tuple to which a refers. By ( A.143| ), however, 
eval(st, A, (g db )" Vl v . v ij) is the same as eval(st, A, g db ). 



Using the definition of subperiod (section |5.3.2| ), ( A. 144 ) becomes ( A. 145 ). (The 



/z>(^t) E JdWI) of ( A.145| ) implies that foWt) is a period, i.e. a non-empty set. 



Hence, the constraint /d(v") ^ of ( |A.144| ) is not needed in QA.145| ).) 

(A. 145) ev al(st, E, g db ) = {(v, v\, . . . , v n ; vt) | for some v' t , v' t ' , 

(vi, . . .,v n ;v' t ) £ eval(st,T,',g db ), 
/dKO = f D (eval{st, X, g db )) n (maxpt{f D {v' t )),t last ], 
v = v' t , and f D {v t ) E /dK')} 



([A.145|) is equivalent to (|A.146| ). 



(A. 146) eval(st, E, g db ) = {{v, v±, . . . , v n ; vt) | for some v' t , 

(v\, . . . , v n ; v' t ) & eval(st, E', g ), v = v' t , and 
/o(«t) E f D {eval(st,\,g db )) n (maxpt(f D (v' t )),ti ast ]} 

For every tuple (v\, . . . , v n ; v t ) £ eval(st,T,, g db ), the /£>(v t ) □ fn(eval(st, X, g db )) n 
(maxpt(f£,(v' t )),ti ast \ in ( A.146j ) implies that fo{vt) is a period, which in turn implies 



that G Dp. ( [A, 146 ) also implies that eval(st, S, is a valid-time relation of n + 1 
explicit attributes. Hence, eval(st,T>, g db ) £ VRELp(n + l). Clause 2 has been proven. 
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Proof of clause 3 



Using the definition of \\Perf[(3,<f>']\\ M( - s V' st > et > lt > a (see section gjj), and the fact that 
\\P\\M(>t)d = g (f3), clause 3 becomes: 



(A.147) 
(A.148) 
(A.149) 
(A.150) 
(A.151) 
(A.152) 
(A.153) 
(A.154) 



(v,v\, . . . , v n ; vt) € eval(st, S, g db ) iff for some g and et' : 
et' e PERIODS 



\M{st),g 



/d(«i), 



|M(st),g 



fD(v n ) 



g((3) = et' 

fD(vt)QfD(eval(st,\,g db )) 

maxpt(et) -< minpt{f £>{vt)) 
ni/nM(st),st,ef,PTS,g = j, 



I first prove the forward direction of clause 3. I assume that (v,v±, . . . ,v n ;vt) £ 
eva^st,^, g db ). I need to prove that for some g and et' , (|A.147| ) - ( A.154 ) hold. 
The assumption that (v, v%, . . . , v n ; Vt) € eval(st,T>, g db ), and ( |A.146| ) imply that for 
some v' t : 

(A.155) 
(A.156) 
(A.157) 



{vi, . . . ,v n ;v' t ) e eval(st, H',g ) 

fD{v t ) E f D (eval(st,X,g db )) n (maxpt(f D (v' t )),t last \ 

v = vt 



I set et' as in ( |A.158| ) and show that ( |A.147| ) - ( |A.154| ) hold. 
(A. 158) et' = f D {y' t ) 

(|A.155[) and clause 3' imply that for some g': 



(A.159) 
(A.160) 
(A.161) 



\\ n \\ M ^ =f D (v 1 ),...,\\T n 

»±l»M{st), S tjDW t ),PTS,g' = T 



\M(st),g' 



Let g = (g'f fD ( v >y ( |A.155| ) and clause 2' imply that v' t £ D P . By lemma |A~7|, ( [A. 147] ) 
holds. ( |A.15l| ) follows from the definition of g and (|A.158| ). The conclusion that 
v[ e Dp implies th at f D {v' t ) G PERIO DS. This and ( |A~T5^ ) imply (|Q4^) . The 
definition of g and ( |A.157|) imply ( |A.149| ). 



I now prove (A.154). The syntax of Top (section |3.2| ) and the fact that Perf[f3, eft'] 6 
YNFORMS , imply that (3 does not occur in 4>' . Along with the definition of g, this 
implies (|A~1d2| ). (|Al6l"l) , ([AT5|), and ( |A~16^ ) imply ( |Al5^ ). 



(A.162) 



fJuM(st),st,et' ,PTS,g' _ 

M{st),st,et',PTS,{g'f fD{v , t) 
,/uM(st),st,et' ,PT8,g 
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(A. 150) follows from lemma A. 3, the assumption that r (p'~ l = (n, . . . , r n ), (A. 160), and 
the fact that g and g' assign the same values to all variables, possibly apart from /3, 
which does not occur in 6'. 



I now prove (|A.152|) . ( |Al56|) and the definition of C imply (|A.163|) and flAlol ). 
(IA.164Q in turn implies ( |A.165| ). 



(A.163) 
(A.164) 
(A.165) 



f D {v t ) G PERIODS 

fD{v t ) C f D (eval(st, X, g db )) n {maxpt(f D (v' t )), t last ] 
f D (v t )Qf D (eval(st,\,g db )) 



From the hypothesis, eval(st, A, G Dp, which implies that f£>(eval(st, A, g d6 )) is the 
empty set or a period. fi)(eval(st, X,g db )) cannot be the empty set, because according 
to ( A.165| ), it has foipt) as its subset, and by ( A.163; ) /o(ut) is a period, i.e. a non- 
empty set. Hence, (|A1d6|) holds. flOjD, ( |A.163|) , and (|A1d6|) imply ( |A~15l) . 



(A.166) 



f D (eval(st, X, g db )) G PERIODS 



It remains to prove ( |A.153| ). ( |A.164| ) implies ( |A.167| ). (maxpt(f£)(v' t )),ti as t} is either a 
period or the empty set (if maxpt(fD(v' t )) = ti as t). It cannot be the empty set, because 
according to ( A.167 ), (maxpt(fi)(v' t )),ti as t] has /i)(ut) as its subset, and by 
/d(^) is a period, i.e. a non-empty set. Hence, ( A.168| ) holds. ( A.167 ), 




and (|AT6|) imply ( |A.169| ). ( |Al5^ ) follows from flP6SD and ( |A.158| ). The forward 
direction of clause 3 has been proven. 



(A.167) 
(A.168) 
(A.169) 



/d(v*) Q (maxpt(f D (v' t )), ti as t] 
{maxpt{f D {v' t )),t last \ G PERIODS 

ast\ 



I now prove the backwards direction of clause 3. I assume that (|047|) - (|A~T54|) hold. 
I need to prove that (v,v\, . . . ,v n ;vt) G eval(st,T>, g db ). According to ( A.146| ), it is 
enough to prove that for some v' t , ( |A.155|) - (IA.157D hold. I set v[ to v as required 
by ( A. 157 ), and prove ( |A.155 ) and ( |A.156| ). According to clause 3', in order to prove 
( |A.155[ ), it is enough to prove that: 



(A.170) 
(A.171) 
(A.172) 



56G 

\\ii\\M(st),stJ D (v' t ),PTS,g 



T 



\M(st),g 



dA370|) and ( |Al7l|) are the same as ( IA.147Q and (|Al50| ), which were assumed to hold. 
( Al57 ) (that hol ds), (|A.149|) and ( |Al5l| ) imply (IA.173Q . (|Al7^ ) and (|A.154|) imply 
(lAlml) . Hence, QA.155Q holds. 



(A. 173) f D (v' t ) = et' 

It remains to prove ( |A.156| ). dA.152|) implies (A. 174) and (A. 175). 

(A. 174) f D (v t ) G PERIODS 

(A.175) f D (eval(st,X,g db )) G PERIODS 
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(A.174), ( |A.14£| ), and p5^ ) imply (A.176) and (A.177). ( |A~T52D implies (A.178) 



(A.177) and (A.178) imply (A.179) 

(A.176) (maxpt(et'),ti ast ) G PERIODS 

(A.177) /dWc/^oIKA/)) 
(A.178) / D (u t ) C (maxpt(et'),t last ] 

(A.179) /d(^) C f D (eval(st, X,g db )) D (maxpt(et') J t lMt ] 

According to (A. 175) and (A.176), fr>(eval(st, A, g d6 )) D (maxpt(et') , ti as t] is the inter- 
section of two periods. The intersection of two periods is the empty set if the periods 
do not overlap, or a period (the overlap of the two periods) if the periods overlap. 
fD(eval(st, A, g dh )) and (maxpt(et'),ti ast \ do overlap, because according to (A.179), 
their intersection has /o(i>t) as its subset, and by (A.174) /d(i^) is a period, i.e. a non- 
empty set. Hence, fu(eval(st, A, g dh )) n (maxpt(et') , ti ast ] is a period. This conclusion, 
(A.174), (A.179), and QA.173 ) imply ( |A.156j ). The backwards direction of clause 3 has 



been proven. 



A. 3.7 Ntense[(3, <p'} 
Translation rule 

If [3 G VARS, 4>' G YNFORMS, and A is a Tsql2 value expression, then: 

def 

trans(Ntense[P,<t>'],\) = 

(SELECT DISTINCT VALID (a) , a.l, a. 2, ... , a.n 
VALID PERIOD (TIMESTAMP 'beginning', TIMESTAMP 'forever') 
FROM trans(4>' ,\ imt ) AS a 

) (SUBPERIOD) 



Xinit is as in section 5.10| , and n is the length of r 0' n . Each time the translation rule is 



used, a is a new correlation name, obtained by calling the correlation names generator. 



Proof that theorem 5.2 holds for <fi = Ntense[f3,(j/], if it holds for 



The proof is very similar to that of section A.3.6 



A . 3 . 8 N tense [now * , <p'] 
Translation rule 

If 4>' G YNFORMS and A is a Tsql2 value expression, then: 

def 

trans(Ntense[now*,4>'], X) = 

(SELECT DISTINCT a.l, a.l, ... , a.n 

VALID PERIOD (TIMESTAMP 'beginning', TIMESTAMP 'forever') 
FROM trans(cj)' , X imt ) AS a 

WHERE VALID (a) = PERIOD (TIMESTAMP 'now', TIMESTAMP 'now') 
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) (SUBPERIOD) 



Xinit is as in section 5.10| , and n is the length of r cf)' n . Each time the translation rule is 



used, a is a new correlation name, obtained by calling the correlation names generator. 



Proof that theorem 5.2 holds for cj) = Ntense[now* , </>'], if it holds for 



The proof is very similar to that of section A. 3. 6 



A. 3. 9 For[a c ,u qty ,(p'] 
Translation rule 

If a c G CPARTS, v qty G {1,2,3,...}, </>' G YNFORMS, and A is a Tsql2 value 
expression, then: 

def 

trans(For[a c ,i/ qty ,(f)'},\) = 

(SELECT DISTINCT a.l, a. 2, ... , a.n 

VALID VALID (a) 

FROM transit', A) AS a 

WHERE INTERVAL (VALID (a) , 7) = INTERVAL 'v qty ' 7) 

n is the length of r (p n , and 7 is the first element of the pair h' cparts {a c ) = (7, E c ) 
(section |5.9[ ). Each time the translation rule is used, a is a new correlation name, 
obtained by calling the correlation names generator. 



Proof that theorem 5^2 holds for 4> = For[a c ,v qty ,(j)'], if it holds for 



I assume that a c G CPARTS, v qty G {1, 2, 3, . . . }, and </>' G YNFORMS. By the syntax 
of TOP, this implies that For[a c ,Vqty,(j)') G YNFORMS. I also assume that st G PTS, X 
is a Tsql2 value expression, g db G G db , eval(st, A, g dh ) G D* P , r (p n = (n, r 2 , r 3 , . . . , r n ), 
hcparts( a c) = (7>E C ) (as in the translation rule), and E = trans{For\a c ,v q t y ,4>'\,X). 
From the definition of r . . . n , it should be easy to see that r For[a c , v q ty, ^ 1 ~~ ' '' 



(t\, . . . , r n ). Finally, I assume that theorem 5.2 holds for <fi = cf>'. I need to show that: 



1. FCN(E) C FCN(X) 

2. eval{st,Z,g db ) G VREL P ( 

3. («!,..., v n ; vt) G eval(st, E, g dh ) iff for some g G G: 
\\ Tl \\M(st), 9 = /c(ui)j ? \\ Tn \\ M ^ = f D ( Vn ), and 

|| .For [cr c , l/giy , 0'] 1 1 ^ (^t),/ D (eW(^,A, a db )), 5 _ T 

Let E' be the embedded SELECT statement in the FROM clause of E, i.e. E' = trans{4>' , A). 
From the hypothesis, 0' G YNFORMS, st G PTS, r n = (n, . . . , r n ), A is a value 



expression, g G G , and eval(st, X, g ) G -Dp. Then, from theorem 5.2 for 



(according to the hypothesis, theorem 5.2 holds for <p = <j)') we get: 
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1'. FCTV(S') C FCN(X) 

2'. eval{st,Z',g db ) G VREL P (n) 

3'. (vi,..., v n ; ft) G eval(st, X', g"*) iff for some g G G: 
\\ Tl \\M(st), 9 = ? ||r n || M ^ = / D (« n ), and 

||0/||M(st),st,/r,(u t ),/ i3 (e«aZ(st,A,g di ')),g = j. 



Proof of clause 1 

The two VALID (a) in the VALID and WHERE clauses of X are not free column references 
in X, because X is a binding context for both of them. The a.l, . . . , a.n in the SELECT 
clause of X are not free column references in X for the same reason. X contains no 
other column references (and hence no other free column references) , apart from those 
that possibly appear within X'. By lemma A.l , this implies ([A.180| ). ( |A.180| ) and 
clause 1' imply clause 1. 

(A.180) FCN(E) C FCNQt?) 



Proof of clause 2 

When computing eval(st, X, g db ), the a of X ranges over the tuples of eval(st, X', g db ). 
By clause 2', eval(st,T,' , g db ) G VREL P (n). Hence, a ranges over tuples of the form 
(vi, . . . ,v n ;v t ), where /dM G PERIODS. The VALID (a) in the WHERE clause of X 
stands for fn(vt)- The arrangements of sections |5.8| and imply that 7 is the name 
of the Tsql2 granularity that corresponds to the complete partitioning f C parts{st)(a c ). 
Then, from the semantics of the INTERVAL function (section 5.3.4] ) , it should be easy to 



see that the constraint INTERVAL (VALID (a) , 7) = INTERVAL 'v qty ' 7 in the WHERE clause 
of X is satisfied iff foivt) covers exactly u qty consecutive periods from f cpa rts(st)(a c ). 



It should now be easy to see from the translation rule that ( A.181 ) holds. 

(A.181) eval(st,J:,g db ) = {(v 1 , . . . , v n ; v t ) G eval{st,Y,\g db ) \ 
for some pi,p 2 , ■ ■ ■ ,Pu qty G f cp arts(st)(a c ) : 
minpt(pi) = minpt{f D{vt)) , next(maxpt(pi)) = minpt(p2), 
next{maxpt(p2)) = minpt{p^), . . . , next(maxpt(p Uqty -i)) = minpt(p Uqty ) , 
and maxpt(pv qty ) = maxpt(f D (v t ))} 



Clause 2' and ( A.181 ) imply that for every (vi, . . . ,v n ;vt) G eval(st, X, g db ), vt G Dp. 
(A.181) also implies that eval(st, X, g db ) is a valid-time relation of n explicit attributes. 



Hence, eval{st,J:, g db ) G VREL P (n). Clause 2 has been proven. 
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Proof of clause 3 



Using the definition of ||For[<7 c , u, 



qty, 



Ll]\\st,et,lt,g 



(section |3.14 ), clause 3 becomes: 



(«!,.. .,v n ;vt) G eval(st,Y,,g ) iff for some g and pi,P2, ■ ■ ■ ,Pv qty ■ 
(A.182) g G G 

(A.183) ||ti|| m ( s *)-s = f D (v!), \\Tn\\ Mi8t) > 9 = f D (v n ) 

(A 184) \\(f)'\\ M ( st ^ st 'fD( v t)jD(eval(st,\,g db )),g _ j, 
(A. 185) pi,... ,p Vqty G fcparts(st)((T c ) 

(A. 186) minpt(pi) = minpt(j 'd(^)) , next(maxpt(pi)) = minptfa), 

next(maxpt(p2)) = minpt(p^), . . . , next(maxpt(p Uqty -i)) = minpt(p v t ), 
and maxpt(p Uqty ) = maxpt(j 'dM) 



I first prove that the forward direction of clause 3 holds. I assume that (v\, . . . , v n ; vt) G 
eval(st, Yi,g db ). I need to prove that for some g, p%, . . . , p V atv , (|A.182| ) - ( A,186| ) hold. 
The assumption that (v\, . . . , v n ; vt) G eval(st, S, g db ), and ( A.181| ) imply that for some 

(A.187) (v 1 ,...,v n ;v t ) G eval{st^\g db ) 
(A. 188) pi, . . . ,p Vqty G fcparts(st)(a c ) 

(A. 189) minpt(pi) = minpt{f oivt)) , next(maxpt(pi)) = minptfa), 

next{maxpt(p2)) = minpt{p^)^ . . . , next(maxpt(p Uqty ^i)) = rninpt{j> Vqty ) , 
and maxpt(p Uqty ) = maxpt(f oivt)) 



(A.187) and clause 3' imply that for some g: 
(A.190) g G G 

(A.191) \\n\\ M{8t) ' 9 = SD{ox%...,\\r n 

(A. 192) ^ ( p'^M( st )< st ,fD(vt)J D (eval(st,\,g db )),g 



M(st), 9 = fD{Vn) 

T 



(Q82p , (|A~l83[ ), ( |A~T8D, (|Al85| ), and ( |Al86D , are the same as (|Al90D , (|Q9l] ), 
( |A192|) , (|A.188j) and (|A.189| ) respectively, which are known to be true. The forward 
direction of clause 3 has been proven 



I now prove the backwards direction of clause 3. I assume that ( |A.182| ) - (|A.186| ) 
hold. I need to prove that (v\, . . . , v n ; vt) G eval(st,T>, g db ). According to ( |A.18l|) , it 
is enough to prove that (|Al87D - (|Al89D hold. ( |Al88| ) and ( |A~T89|) ar e the same as 
( A.185 ) and ( A.186 ) respectively, which were assumed to hold. (|A.187 ) follows from 
clause 3' and dAlSp , flP&jD , and (|A.184Q . The backwards direct ion of clause 3 has 
been proven. 



A. 3.10 Begin[(j)'} 
Translation rule 



If <j)' G YNFORMS and A is a TSQL2 value expression, then: 
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trans(Begin[<fi'], A) = 

(SELECT DISTINCT a.l, a. 2, ... , a.n 
VALID PERIOD (BEGIN (VALID (a) ) , BEGIN (VALID (a) ) ) 
FROM trans((j>', A imf ) (NOSUBPERIOD) AS a 
WHERE A CONTAINS BEGIN (VALID (a) ) ) 



n is the length of r (fi'~ l , and Xi n u is as in section 5.1C . Each time the translation rule is 
used, a is a new correlation name, obtained by calling the correlation names generator 
after A has been supplied. 



Proof that theorem 5^2 holds for <j) = Begin[<f>'\, if it holds for 



I assume that </>' G YNFORMS . By the syntax of Top, this implies that Begin[(f>'] G 
YNFORMS . I also assume that st G PTS, A is a Tsql2 value expression, Xi n u is 
as in section gltj , g db G G db , eval(st, A, g db ) G D* p , r /n = (n, r 2j r 3 , . . . , r n ), and 
E = trans (Begin[<j)'], A). From the definition of r . .. n , it should be easy to see that 
r Begin[<p']~ l = r cj)'~ l = (n, . . . , r n ). Finally, I assume that theorem |5.2| holds for <fi = <f>'. 
I need to show that: 



1. FCN(E) C FCN(X) 

2. eval{st,Z,g db ) G VREL P (n) 

3. (wi,..., v n ; ut) G eval(st, E, <7 d6 ) iff for some g £ G: 
\\ Tl \\M(st), 9 = ; || Tn ||^(^).f = and 

||5e5in[(/> / ]|| M ( s *)' s *'- / ' I3 ( l '')' /D ( em ^ st ' A ' 9db ^' 9 = T 

According to the syntax of Tsql2, Xi n u is a value expression. By lemma [57T| , (A. 193) 
- (A. 195) hold. 

(A.193) f D (eval(st,X imt ,g db )) = PTS 

(A.194) eval(st,X mit ,g db )eD P 
(A. 195) FCN(\ imt ) = 

Let E' be the SELECT statement in the FROM clause of E, i.e. E' = trans((f)', Xmit)- From 
the hypothesis, 0' G YNFORMS, st G PTS, r /n = (<q, . . . ,r n ), and <? db G G dfc . From 
the discussion above, Xi n u is a Tsql2 value expression, and eval(st, Xmit, g db ) G Dp. 
Then, from theorem |5.2| for (j> = <fr' (according to the hypothesis, theorem |5.2| holds for 
4> = 4>'), and using (A.193) and (A. 195), we get: 

1'. FCN{T!) C 

2'. eval(st, S', g db ) G VRELp(n) 

3'. (vi,..., v n ; t>£) G eval(st, E', <? rf6 ) iff for some g £ G: 
\\ Tl \\M(st), 9 = /c(ui)j ? || Tn || M ^ = / D (u n ), and 
||0/||M(si), S t,.fo(^),PTS, 9 = T 
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Proof of clause 1 



The three VALID (a) in the VALID and WHERE clauses of E are not free column references 
in E, because E is a binding context for all of them. The a.l, . . . , a.n in the SELECT 
clause of E are not free column references in E for the same reason. E contains no 
other column references (and hence no other free column references) , apart from those 
that possibly appear within E' or the A in the WHERE clause of E. By lemma AA, this 
implies ( A.196| ). Clause 1' and ( [A. 196 ) imply clause 1. 



(A.196) 



FCN(E) C FCW(E') U FCN(X) 



Proof of clause 2 

When computing eval(st,T,, g db ), the a of E ranges over the tuples of the relation 
nosubperiod(eval(st,T l ' , g db )). By clause 2', ev al(st, E', g db ) G VRELp(n). From the 
definition of nosubperiod (section |5.3.2 ), it follows that nosubperiod(eval(st, E', g db )) is 



also a valid-time relation of n explicit attributes, and that its time-stamps are also time- 
stamps of eval{st,T,' ,g db ), i.e. elements of Dp. Hence, nosubperiod(eval(st,T,' , g )) G 
VREL P {n). 

a is generated by calling the correlation names generator after A has been supplied. 
Hence, a cannot appear in A. The fact that a does not appear in A means that for 
every tuple (vi, . . . , v n ; v' t ): 

(A.197) eval(st, A, {g db ) a {vi _ Vn , K) ) = eval(st, A, g db ) 



The reader should now be able to see from the translation rule that ( |A.198 ) holds. 
Intuitively, (v i, . . . , v n ; v' t ) is the tuple of nosubperiod(eval(st,T>' , g db )) to which a 
refers. I should have used eval(st, A, {g db )" Vi Vn - V >\) instead of eval(st, A, g ), to cap- 
ture the fact that if there are any free column references of a in A, these have to be 
taken to refer to the (v\, . . . , v n ; v' t ) tuple to which a refers. By ( |A.197| ), however, 
eval(st, A, (g db )" Vi Vn - V ')) i s the same as eval(st, A, g db ). 

(A. 198) eval(st,Y<, g db ) = {{vi, . . . ,v n ;v t ) \ for some v' t , 

{v\, . . . , v n ;v' t ) £ nosubperiod(eval(st, E', g db )), 
fD(v t ) = {minpt(f D (v' t ))}, and 
minpt{f D {v' t )) e f D (eval(st, A, g db ))} 



Using the definition of nosubperiod (section |5.3.2j ), ( |A.198| ) becomes ( |A.199 ). 

(A. 199) eval(st, E, g db ) = {(vi, ... ,v n ;vt) \ for some v' t , 

{vi,...,v n ;v' t ) 6 eval(st,T,',g db ), 

there is no {v\, . . . , v n ; v' t '} £ eval(st, E', g db ) 

such that foiv't) C foiv"), 
fDiy t ) = {minpt(f D (v' t ))}, and 
minpt{f D {v' t )) G f D {eval(st,X,g db ))} 
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(A. 199) implies that every time-stamp vt of eval(st,Y,, g db ) represents an (instanta- 
neous) period that contains only the earliest chronon of a time-stamp of eval(st, g db ) . 
This implies that vt G Dp. (A. 199) also implies that eval(st,T,, g ) is a valid-time 
relation of n explicit attributes. Hence, eval(st,Jl, g db ) G VRELp(n), and clause 2 has 
been proven. 



Proof of clause 3 



Using the definition of \\Begin[(()']\\ M ^' st ' et,lt ' 9 (section |3.12|) , clause 3 becomes:Q 

(«!,..., v n ; v t ) G eval(st, S, g db ) iff for some <? and et' : 
(A.200) 5 G G 

(A.201) ||n|| M ^^ = . . . , \\r n \\ M ^ = f D { Vn ) 

(A.202) f D (v t ) C f D (eval(st, A, 

(A.203) / D (v t ) = {mmpt(et')} 

(A.204) et' G mx/pers({e G PERIODS \ \\(j)\\M{st),st,e,PTS,9 = T ^ 



I first prove that the forward direction of clause 3 holds. I assume that {v i, . . . , v n ; vt) G 
eval(st,Y<,g db ). I need to prove that for some g and et' , (1A.200[) - (|A.204|) hold. The 



assumption that (v\, . . . , v n ;vt) G eval(st, S, g db ), and ( A.199| ) imply that for some 

(A.205) ( Vl ,...,v n ;v' t ) £eval(st,Z',g db ) 

(A.206) there is no (v 1} .. .,v n ;v") G eval(st,T,' , g db ) 

such that /d(u£) C /dK') 
(A.207) /o(^) = {minpt(f D (v' t ))} 

(A.208) minpt{f D (v' t )) G f D (eval(st, A, 

([A.205|) and clause 3' imply that for some g: 



(A.209) 
(A.210) 
(A.211) 



<?GG 

Wn\\ M ™' g = fD(vi),.. 

u,iuM{st),stJ D {v' t ),PTS,g 



\\ T n 

T 



\M(st),g 



fD(v n ) 



I set et' as in (|A.212|) and prove that (|A.200[) - (|A.204|) hold. (|A.200|) and (|A.201| ) 
are the same as (|A.209| ) and (A.210), which are known to hold. ( A.203j ) follows from 
(|A.207|) and (|A.212|) . 



(A.212) 



et' = foiv't) 



I now prove ( |A.202|) . According to (|A.205|) , v' t is a time-stamp of eval(st, and 
by clause 2', eval(st,T,' , g db ) G VRELp(n). This implies that -Uj G -Dp, which in turn 

1 The reader is reminded that the mxlpers symbol is overloaded. When I £ TELEM S, mx lpers(l) is 
the set of the maximal periods of a temporal element, and it is defined as in section 5.2.2. When S 
is a set of periods, mxlpers(S) is the set of the maximal periods of a set of periods, and it is defined 
as in section B.3. 
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implies (A.213). By ( |A.207| ), fo(vt) is the instantaneous period that contains only the 
first chronon of /d( u 0- Therefore, (A.214) also holds. 



(A.213) 
(A.214) 



f D (v' t ) G PERIODS 
f D {v t ) G PERIODS 



From the hypothesis, eval(st, A, g db ) G Dp, which implies that fz)(eval(st, A, g dh )) is 
a period or the empty set. ( |A.208|) im plies that fo(eval(st, A, g db )) is not the empty 
set. Therefore, (A.215) holds' (|A.208|) implies (A.216). (A.216), flOOTb , (A.214), and 
(A.215) imply ( |A..202Q . 



(A.215) 
(A.216) 



f D (eval(st, X, g db )) G PERIODS 
{minpt{f D {v' t ))} C f D (eval(8t,X,^ b )) 



It remains to prove (|A.204Q . (|A.212|) , (A.213), and (|A.211| ) imply that: 
(A.217) et' G {e G PERIODS \ y\\M{st),st,e,PTS,g = T | 



To prove ( A.204| ), I need to prove that there is no et" that satisfies both (A. 218) and 
(A.219). 



(A.218) 
(A.219) 



et" G {e G PERIODS \ \\0\\M(«t),*t,e,PTS,g = T | 
et' C et" 



Let us assume that for some et", (A.218) and (A.219) hold. (A.218) implies that: 

(A.220) y\\M{st),st,et",PTS,g = T 



(A.221) 



et" G PERIODS 



I set v'l = f^iet"), which implie s (A.22 2). (|A7209|) , flPIcD , (A.222), (A.220), and 
clause 3' imply (A.223). (A.219), flOl2D , and (A.222) imply (A.224). 



(A.222) 
(A.223) 
(A.224) 



et" = f D {v'{) 
(vi, . . . , v n ;v") G eval(st,T l ',g db ) 
f D (v' t ) c f D (v'l) 



(A.223) and (A.224) are against ( A.206] ). Therefore, there can be no et" that satisfies 
(A.218) and (A.219). (|A.204j ) and the forward direction of clause 3 have been proven. 



I now prove the backwards direction of clause 3. I assume that for some g and et' , 
( [A.200D - QA.204D hold. I need to prove that {v\, . . . , v n ; v t ) G eval(st, S, g db ). Accord- 
ing to ( IA.198D , it is enough to prove that for some v' t , (|A.205|) - ( |A.208|) hold. 



dA~204|) implies (A.225). I set v' t = f^je t'), wh ich implies (A.226). flA~200D, (|O0lD , 
(A.226), (A.225), and clause 3' imply ( |A.205|) . (|A.207|) follows from ^2p| )~and 
(A.226). 



(A.225) 
(A.226) 



M(st),st,et',PTS,g 

et' = f D {v' t ) 



T 
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I now prove ( |A.208Q . p0| and ( pO^ ) imply ( |A~227j ). ( [A.227p and (A.226) imply 
flA308p . 

(A.227) minpt(et') G f D (eval(st, A, / 6 )) 

It remains to prove ( |A.206| ). Let us assume that there is a tuple (v±, . . . ,v n ; v' t '), such 
that (A.228) - (A.229) hold. 

A.228) (v!,. . . ,v n ;v") G eval(st,Y! , g db ) 

A.229) f D (v' t ) C f D {v'{) 

A.228) and clause 3' imply that for some g': 

A.230) g' £G 

A.231) Hnir^' = f D (yx), . . . , \\r n \\ M ^' = f D (v n ) 

A. 232) u^^M(st),st,f D (v' t '),PTS,g' = j, 

Lemma [0|, ( |A.200| ), (|A.230|) , the assumptions that 0' G YNFORMS and r c/> n = 
ri, . . . , r n ), ( |A.201| ), and ( A.231 ) imply that for every variable (3 in 0', g(/3) = g'(/3). 
Then, (A.233) holds. (|A.232|) and (A.233) imply (A. 234). 

A.233) U'\\M(st),st,fD{v' t '),PTS,g' _ ^^M(st),stjD(v' t '),PTS,g 

A.234) U'\\M(st),stJn{v' t '),PTS,g _ T 

set et" as in (A.235). (A.229), (A.226), and (A.235) imply (A.236). (A.234) and 
A.235) imply (A.237). 

// 



A.235) 
A.236) 
A.237) 



et" = f D {v'{) 
et' C et" 

h tnM(st),st,et",PTS,g 



T 



A.236), and (A.237) are ag ainst flPO^D , b ecause ( |A.204j ) and the definition of mxlpers 
see section |3.3|) imply that there is no et" G PERIODS , such that et' C et" and 
||^/||M( s t), s t,et ,PT5,g = r Therefo re, there can be no tuple (n, . . . ,v n ;v'() such that 
(A.228) and (A.229) hold. ( A. 206 ) and the backwards direction of clause 3 have been 
proven. 



A.3.11 End[(f)'} 
Translation rule 

If §' G YNFORMS and A is a Tsql2 value expression, then: 
trans(End[4>'], X) d = 

(SELECT DISTINCT a.l, a. 2, ... , a.n 
VALID PERIOD (END (VALID (a) ) , END (VALID (a) ) ) 
FROM transit, A imf ) (N0SUBPERI0D) AS a 
WHERE A CONTAINS END (VALID (a) ) ) 



n is the length of r ^ n , and Xi n u is as in section 5.1C . Each time the translation rule is 
used, a is a new correlation name, obtained by calling the correlation names generator 
after A has been supplied. 
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Proof that theorem 5.2 holds for <fi = End[<fi'], if it holds for 



The proof is very similar to that of section A.3.10 



A. 3.12 M[K,<j/\ 
Translation rule 

If k G CONS, 4>' G YNFORMS, and A is a Tsql2 value expression, then: 

trans(At[K, </>'], A) = trans((f>' , A') 

where A' is the expression INTERSECT (A, h' cons (K)). 



Proof that theorem 5^2 holds for 4> = At[K, <j)'\, if it holds for 



I assume that k G CONS and (j)' G YNFORMS. By the syntax of Top, this implies that 
At[n, (j)'] G YNFORMS. I also assume that st G PTS, A is a Tsql2 value expression, A' 
is as in the translation rule, g db G G db , eval(st, A, g dh ) G D* p , r <p'~ l = (n, t%, T3, . . . , r„), 
and X = trans(At[K, (/>'], A). From the definition of r . . . n , it should be easy to see that 
r At[n, (ft']" 1 = r (J)' n = (n, . . . j r n ). Finally, I assume that theorem 5.2 holds for eft = eft'. 
I need to show that: 



1. FCN(E) C FCN(X) 

2. eval(st,J:,g db ) G VREL P (n) 

3. («!,..., v ra ; vt) G eval(st, S, iff for some 5 G G: 
\\ Tl \\M(st), 9 = ? ||T n || M ( s *)>3 = / fl (« n ), and 
IIAfk, / ]l| M ( st )' s *>/£'(«t)./x>(e«aZ(st,A,g di ')),g _ y 



The restrictions of section |5.5| guarantee (A. 238). By the definitions of f CO ns and h c 



of sections |J and |5j, (A.239) holds and (A.238) is equivalent to (A.240). (I do not 
include the assignment to the correlation names among the arguments of eval, because 
by section |J, FCN(h' cons {K)) = 0.) 

(A.238) fcons(st)(K) G PERIODS 

(A.239) fcons(st)(K) = f D (eval(st, h' cons (n))) 

(A.240) f D {eval(st,h' cons (K))) G PERIODS 



By the syntax of Tsql2, since A is a value expression, A' is also a value expression. 
From the definition of A' it should be obvious that any free column reference in A' is 
situated within the A of A'. By lemma A.l , this implies ( A. 241 ). 



(A.241) 



FCN(X') C FCN(X) 
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The assumption that eval(st, A, g db ) G Dp, (A. 240), and the definition of A' imply 
(iAT242|) and that eval{st, X',g db ) G 



(A.242) f D (eval(st,X',g db )) = f D (eval(st, X,g db )) D f D (eval(st,h f cons (rz))) 



db\ 



The hypothesis that £ = trans(At[K, <f>'], A) and the translation rule of this section 
imply that E = trans(<p 1 A'). From the hypothesis, 0' G YNFORMS, st G PTS, 
r </> n = (ri, . . . ,T n ), and g d6 G G db . According to the discussion above, A' is a value 
expression, and eval(st, X' , g db ) G Dp. Then, from theorem 5.2 for <j> = <fi' (according 
to the hypothesis, theorem holds for <fi = <fi') we get: 



1'. FCN{H) C FCN(X') 

2'. eval{st,Z,g db ) £ VREL P (n) 

3'. (ui,..., w ra ; ut) G eval(st, S, <7 d6 ) iff for some g £ G: 



\M(st),g 



f D (vi), ... , \\t„ 



\M(st),g 



fnivn), and 



y||M(st),st,/ r) (%),/D(emKst,A',9' 1,, )),g = y 



Proofs of clauses 1 and 2 



Clause 1 follows from clause 1' and ( A.24l| ). Clause 2 is the same as clause 2', which 
is known to hold. 



Proof of clause 3 

I prove clause 3 by proving (A. 243). If (A. 243) holds, then clause 3' is equivalent to 
clause 3, and since clause 3' holds, clause 3 holds too. 



(A.243) 



u ( j ) /uM(st),stJ D (v t )J D (eval(st,\',g db )),g _ y jg 
\Atl K ^tV\M(si),st,f (vt),f D (.eval{st,\,g db )),g _ p 



From the hypothesis, k G CONS. By the semantics of Top, this implies that: 

(A.244) M\ M(St) ' 9 = fcons(st)(K) 



I first prove the forward direction of (A.243). I assume that ( A. 245 ) holds. 

(A. 245) \\(frl\\M(st),st,fn(vt),fD(eva,l(st,\',g db )),g _ p 

Using fl£24p and (A.239), (|AT245|) becomes ( WM§ . 

(A. 246) \\ ( j ) >^I(st),stJ D (vt)jD(eval(st,\,g db ))nfcons(st)(K),g _ rp 



Using flAjg ), (|A.246Q and (A.238) become (A.247) and (A.248) respectively. (A.247), 
(A. 248), and the definition of At (section (3.10|) imply (A. 249). The forward direction 
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of (A. 243) has been proven. 

(A. 247) ||^/||M( s «), s t,/ D («t),/i3(e»aKs*^.9' ii '))n||«|| M(st) ' 9 ,g = T 

(A.248) \\K\\ M{st) ' 9 G PERIODS 

(A. 249) || j4£[k, (f)'~\\\ M( - st )' st 'fD(vt)jD(eval(st,\,g db )),g _ j, 

I now prove the backwards direction of (A. 243). I assume that (A. 249) holds. By 
the definition of At, this implies that (A. 247) holds. Using ( |A.244j) , (A. 247) becomes 
(|A.246|) . (A.239), jOi^ ), and ([A.246Q imply ( |A.245|) . The backwards direction of 
(A. 243) has been proven. 

A.3.13 Befare[K,<j/\ 
Translation rule 

If k G CONS, (j)' G YNFORMS, and A is a Tsql2 value expression, then: 
trans(Before[K, (j)'], X) = trans{dp ,\ ) 
where A' is the expression: 

INTERSECT (A, PERIOD (TIMESTAMP 'beginning', BEGIN (.h' cons (n)) - INTERVAL '1' x)) 
and x is the Tsql2 name of the granularity of chronons. 



Proof that theorem 5^2 holds for <fi = Before[n,4>'], if it holds for 



The proof is very similar to that of section A. 3. 12 



A. 3.14 After[K,<f/\ 
Translation rule 

If k G CONS, dp' G YNFORMS, and A is a Tsql2 value expression, then: 

def 

trans(After[K,(f)'], A) = trans ((/)' , A') 
where A' is the expression: 

INTERSECT (A, PERIOD (END (.h' cons (K) ) + INTERVAL '1' \> TIMESTAMP 'forever')) 
and x is the TSQL2 name of the granularity of chronons. 



Proof that theorem 5.2 holds for <fi = A fter[n, <£>'], if it holds for </> = </>' 
The proof is very similar to that of section A.3.12 . 
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A. 3.15 At[a g ,p,(j)'} 
Translation rule 

If cr g G GPARTS, (3 G VARS, 6' G YNFORMS, and A is a Tsql2 value expression, 
then: 

def 

trans{At[a g , (3,6],X) = 

(SELECT DISTINCT ai . 1 , a 2 . 1 , a%.1, ... , a 2 .n 
VALID VALID (a 2 ) 

FROM (h' gparts (a g )) AS a\, trans(<f>' , A') AS a 2 ) 

n is the length of r 6'~ l , and A' is the expression INTERSECT , A)0 Each time the 
translation rule is used, a\ and «2 are two new different correlation names, obtained 
by calling the correlation names generator after A has been supplied. 



Proof that theorem 5.2 holds for 6 = At[a g , (3, 6'], if it holds for 



I assume that a g G GPARTS, (3 G VARS, and 6' G YNFORMS. By the syntax of 
Top, this implies that At[a g , j3, 6'} G YNFORMS. I also assume that si G PTS, A is a 
Tsql2 value expression, A' is as in the translation rule, g G G db , eval(st, A, g dh ) G -Dp, 
r 6 n = (n, T2, T3, . . . , r n ), and £ = trans(A£[<7 9 , /3, (/>'], A). Prom the definition of r . . . n , 
it should be easy to see that r At[a g , (3, cp']" 1 = ((3, t\, . . . , r n ). Finally, I assume that 



theorem 5.2 holds for 6 = 6.1 need to show that: 



1. FCN(E) C FCN(X) 

2. eval(st,Z,g db ) G VRELp(n + 1) 

3. (u, ui, . . . , f n ; t>t) G eval(st, E, iff for some g £ G: 

m M(st), 9 = /d(u)j || ri ||M( si))5 = /d(ui)) ? ||r n || M ( st )>s = f D (v n ), and 

||^4i[(Tg,/3, / ]|| M ( si )' st '/l?(' ; i)jD(em«(st,A,(; db )),g _ rp 

From the definition of A' it should be obvious that any free column reference in A' is 
either the a\.l of A', or is situated within the A of A'. By lemma |A.1| , this implies 
(A. 250). The only correlation name that has a free column reference in oc\.l is oc\. 
Hence, FCN{a 1 .l) = {ax}, and (A.250) becomes (A.251). 

(A.250) FCN(X') C FCN( ai .l) U FCN(X) 

(A.251) FCN(X') C {ai} U FCN(X) 



The ai of S is generated by calling the correlation names generator after A has been 
supplied. Hence, a\ cannot appear in A. The fact that a\ does not appear in A means 



2 A' will end up being a part of the embedded SELECT statement trans(<f>' , A'). This means that 
ati, which is part of A', will appear within trans{4>' ,A'), i.e. in the s ame F ROM clause that defines 
at\. This is allowed in the TSQL2 version of this thesis; see section 5.3.5. Similar comments ap- 
ply to the translation rules for Before[a g ,/3,cj>'], After[a g , /3, <f>'], At[a c , /3, <f>'], Before[a c , P, </>'], and 
After[a c .,,8,o'\. 
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that for every possible tuple (v) of a one-attribute snapshot relation (i.e. for every 
v E D): 

(A.252) eval(st, A, {g db )J\ ) = eval(st, A, g db ) 



The assumption that eval(st, A, g db ) E -Dp, and the fact that for every d£D, ( A.252; ) 
holds imply that (A. 253) and (A. 254) hold for every v E D. 

(A.253) eval(st,X,(g db )^)eD P 
(A.254) f D (eval(st, A, (s*)^)) E PERIODS* 



By the syntax of Top, since A is a value expression, A' is also a value expression. If 
v E Dp (Dp C D), by (A.254) (which holds for v £ D) and the definition of A', (|A.255| ) 
and jAl56| ) hold. 

(A.255) f D (eval(st, A', = D f D (eval(st, A, (<? d6 )g)) 

(A.256) eval(st, A', (g db )?A) E Dp 



Let E' be the embedded SELECT statement in the FROM clause of £ to which 02 refers, 
i.e. £' = transit, A'). From the hypothesis, 4>' E YNFORMS, st E PTS, c/ db E G, and 
r (p n = (tx, . . . ,t„). From the discussion above, A' is a value expression. If v E -Dp, 
then (g db )^ } E G<* and (by ( [A256D ) et;a/(si, A', (3*)^) E Dp. From theorem |]| for 
(f> = <f>' (according to the hypothesis, theorem holds for <f> = (j)') we get the following. 
(The condition v E Dp does not affect clause 1'.) 



1'. FCN(E') C FCN(X') 

2'. If u E -Dp, then euaZ(si,£ y \(g db )?A) E VREL P (n) 

3'. If v E -Dp, then: 
( 

(«!,..., v n ; u t ) E eval(st, (g db )^)) iff f° r some 5' G G: 
|| Tl ||M( s t) lS ' = /d(ui); ; ||r n || M ( s *)^' = f D (v n ), and 

^ r ^M(st),stJ D (v t ),f D (eval(st,X',(g db )^ ) )),g' = j, 
) 



Proof of clause 1 



The VALID(a 2 ) in the VALID clause of £ and the ai.l, 02-1, ... , ai-n in the SELECT 
clause of £ are not free column references in £, because S is a binding context for 
all of them. £ contains no other column references (and hence no other free column 
references), apart from those that possibly appear within the h' arts (a g ) or the £' in 
the FROM clause of £. By lemma A.l, this implies (A. 257). The definition of h' gparts (a g ) 



of section ^9| implies that FCN(h' gparts (a g )) = 0. Hence, (A. 257) becomes (A.258). 
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(A.258) and clause 1' imply (A.259). (A.259) and (A.251) imply (A.260). 

(A.257) FCN&) C FCN(h' gparts (a g )) U FCN(Z') 

(A.258) FCN(E) C FCW(E') 

(A.259) FCN(Ti) C FCN(X') 

(A.260) FCW(E) C { ai } U FCW(A) 

ai cannot have a free column reference in E. The proof follows. Let us assume that 
there is a free column reference £ of ot\ (i.e. of the form a±.k or VALID(ai)) in E. Then, 
by the definition of free column reference, (a) £ is part of E. From the translation rule, 
it is also the case that (b) a± is defined by the topmost FROM clause of E. 

If £ is in the topmost FROM clause of E, it is either in the h' gparts (a g ) or in the E' (the 
trans((f>' , A')). £, however, cannot be in the h gparts (a g ), because a\ is generated by 
the correlation names generator, and it is assumed that the correlation names of the 



generator do not appear in the SELECT statements returned by h gparts (see section 5.11 ). 



So, if C is in the topmost FROM clause of E, it is in the E', i.e. after the definition of 
ol\. Hence, (c) Q is either not in the topmost FROM clause of E, or it is in the topmost 
FROM clause of E, but it follows the definition of a\. (a), (b), and (c) imply that E is 
a binding context for This implies that £ is not a free column reference in S, which 
is against the hypothesis. Hence, a\ cannot have a free column reference in E, i.e. 
a x FCiV(E). This and (A.260) imply that FCiV(E) C FCN(X). Clause 1 has been 
proven. 



Proof of clause 2 

According to section |5l| FCN(h gparts (a g )) = 0, eval(st, h' gparts (a g )) G SREL(1), and 
for every (v) G eval(st, h' gparts (cr g )), v represents a period of a gappy partitioning (hence, 
v G Dp). By clause 2', it is also the case that if v G Dp, then eval(st,T, f , {g db )^) G 
VRELpin). It should now be easy to see from the translation rule that ( |A.261| ) holds. 

(A.261) eval{st,Z,g db ) = {(v, v u . . . , v n ; v t ) \ 

(v) G eval(st,h' gparts (a g )), and 
(v 1 ,...,v n ;v t ) eeval( S t,V,(g db )^ } )} 

(g db )"v) i s use d in the last line of (|A.261| ) instead of g db , to capture the fact that if there 
is any free column reference of ot\ in E', this has to be taken to refer to the (v) tuple 
of eval(st, h' gparts (a g )) to which a± refers. 



( A.261D and the discussion above imply that for every (v, v\, . . . , v n ;vt) G eval(st, E, g db ), 



(v\, . . . , v n ; vt) G eval(st,T,' , (g ' G VRELp(n). This implies that v t G Dp. 

(A.261) also implies that eval(st,Y,, g db ) is a valid-time relation of n + 1 explicit at- 



tributes. Hence, eval{st,Yi,g ) G VRELp(n + 1). Clause 2 has been proven. 
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Proof of clause 3 



Using the definition of ||Af[(T fl , /3, 
g(f3), clause 3 becomes: 



Li] \\st,et,lt,g 



(section and the fact that \\P\\ M ^'9 



(A.262) 
(A.263) 
(A.264) 
(A.265) 
(A.266) 



(v, v\, . . . , v n ; vt) G eval(st, E, g db ) iff for some g : 

g^G 

g(P) = f D (v) 

WnW*™* = fDW, - ||r„|| M ^ = / D K) 
g{(3) G f gP arts(st)(a g ) 

uauM(st),stJ D (v t ),f D (eval(st,\,9 db ))n g (l3),g _ j, 



I first prove the forward direction of clause 3. I assume that (v, v\, . . . , v n ; Vt) G 
eval(st,Y,, g db ). I need to prove that for some g, ( A.262[) - ( |A.266| ) hold. The as- 
sumption that (v,v\, . . . , v n ;vt) G eval(st, S, g db ), and ( A.261| ) imply that: 

(A. 267) (v) G eval(st, h' gparts (a g )) 

(A.268) {vi,...,v n ;v t ) £ eval(st,Z> , (g db )^) 

(|A.267[) and the discussion in the proof of clause 2 imply (|A,269| ) . 

(A.269) v G Dp 

(|A.269| ), ( |A.26g| ), and clause 3' imply that for some g : 

(A.270) g' eG 

(A.271) 

(A.272) 



rm— =fD(vi),...,\\r n \\ M(st, ' g ' =/dK) 



\M(st),g' 

b/ nM(st)MjD(vt)J D (eval(st^%(g< lb )"i } )),g' = j, 



\M(st),g' 

\l („db\ a l \\ „' 



Let g = (g')f D f v y Lemma |A/7|, (|A.269| ), the assumption that G VARS, ( |A.270i ), and 
the definition of g imply ( A.262| ). ( A,263|) follows from the definition of g. 

I now prove flP66| ). ( [A369| ), (|A.255| ) (which holds for v G Dp,) and (^26|) (already 
proven) impl y (|A.273| ). ( |A~269|), the fact that D P C D, ( |A~25^) (w hich holds for 
v G D), and ( |A.273| ) imply ( |A.274| ). (^27|) and (^27|) imply flA.27^ ). 



f D (eval(st, A', (g db )^)) = g{fi) D f D (eval(st, A, (g d %)) 
f D (eval(st,\',(g db )^ ) ))=g(f3)nf D (eval(st,\,g db )) 

lU lnM(st),stJ D (vt)J D (eval(st,\,g db ))ng(f3),g' _ T 



(A.273) 
(A.274) 
(A.275) 

The syntax of Top (section and the fact that At[a g , f3,<f>'] G YNFORMS, imply 
that (5 does not occur in (/)'. This and the definition of g imply ( A.276j ). ( A.275Q and 
([AT276D imply ( |AT266| ). 



(A.276) 



y^M(st),st,f D (v t )J D (eval(st,\,9 db ))ng(l3),g' = 
/ ,,M(st),st,fD(vt)J D (eval(st,\,g db ))ng(t3),(g'f fD{v) 



ilnM( S t),stjD(vt)J D (eval(st,\,g db ))ng(f3),g 
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(A. 264) follows from lemma A. 3, the assumption that r (p'~ l = (n, . . . , r n ), (A. 271), and 
the fact that g and g' assign the same values to all variables, possibly apart from /3, 
which does not occur in 6'. 



It remains to prove ( A.265 ). The definition of h gpar t s of section ^1] and ( |A.267 ) imply 



(A.277). (A.277) and the definition of f gparts of section Kj imply (A.278). (A.278) 



and ( A.263j ) (proven above) imply ( A.265| ). The forward direction of clause 3 has been 
proven. 



(A.277) 
(A.278) 



{v} G h gpar t s (St) (a g ) 
f D (v) G fgparts (St) {a g ) 



I now prove the backwards direction of clause 3. I assume that (A. 262) - (A. 266) hold. 
I need to prove that (v, v\, . . . , v n ; vt) G eval(st,Y,, g db ). According to ( A. 261 ), it is 
enough to prove that ( A.267| j and ( A. 268] ) hold. 



(|A.263|) and (|A.265|) imply (A.279). (A.279) and the definition of f gparts of section [p 



imply (A. 280). (A. 280) and the definition of h g p ar ts of section [T^ imply ( A.267| ) 



(A.279) 
(A.280) 



/d(«) G fgparts {St) (a g ) 
{v} G hgparts{st){a g ) 



It remains to prove ( |A.268j). (A. 279) implies that /d(«) G PERIODS, which in turn 
implies (A.281). (A.281), ( |A.255|) (which holds for v G Dp), (|Ai25^) (which holds for 
v € D), and ( |Ogg ) imply (A.282). ( gjgD and (A.282) imply (A.283). 



(A.281) 
(A.282) 

(A.283) 



v G Dp 

f D (eval(st,X',(g db )^ ) ))=g((3)nf D (eval(st,X,g db ) 

,,,n,M(st),stJ D (vt)J D (eval(st,X',(g db )^ ) )),g = rp 



Clause 3', (A.281), flOfflq) , (^26|), and (A.283) imply ( gjgg ). The backwards 
direction of clause 3 has been proven. 



A. 3.16 Before[(Tg,fJ,(i)'] 
Translation rule 

If cr g G GPARTS, (3 G VARS, <p' G YNFORMS, and A is a Tsql2 value expression, 
then: 

trans(Before[a g , /3, 0'], A) d = 

(SELECT DISTINCT a x .l, a 2 .l, a 2 .2, ... , a 2 .n 
VALID VALID (a 2 ) 

FROM (/ig parts (cr g )) AS ati , trans((j>' , X') AS a 2 ) 

n is the length of r n , A' is the expression INTERSECT (PERIOD (TIMESTAMP 'beginning' , 
BEGIN(ai.l) - INTERVAL '1' x) » X) , and % is the TSQL2 name of the granularity of 
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chronons. Each time the translation rule is used, a\ and ai are two new different 
correlation names, obtained by calling the correlation names generator after A has 
been supplied. 



Proof that theorem 5^2 holds for (j) = Before[a g , P, ft], if it holds for 



The proof is very similar to that of section A. 3. 15 



A.3.17 After[a g ,/3,ft] 
Translation rule 

If a g G GPARTS, p G VARS, ft G YNFORMS, and A is a Tsql2 value expression, 
then: 

trans(After[a g , P, ft], A) = 

(SELECT DISTINCT a-i.l, a 2 ■ 1 , a 2 .2, ... , a 2 .n 
VALID VALID (a 2 ) 

FROM (h' gparts (a g )) AS cti, trans(ft , A') AS a 2 ) 

n is the length of r ft~ l , A' is the expression INTERSECT(PERIOD(END(ai.l) + INTERVAL 
'1' x, TIMESTAMP 'forever'), A), and x is the TSQL2 name of the granularity of 
chronons. Each time the translation rule is used, a± and 02 are two new different 
correlation names, obtained by calling the correlation names generator after A as been 
supplied. 



Proof that theorem ^2 holds for (j) = After [a g , (3, ft], if it holds for 



The proof is very similar to that of section A. 3. 15 



A.3.18 At[a c ,0,ft] 
Translation rule 

If a c G CPARTS, p G VARS, ft G YNFORMS, and A is a Tsql2 value expression, 
then: 

de f 

trans{At[a c ,P,ft],X) = 

(SELECT DISTINCT ai.l, a 2 ■ 1 , a 2 .2, ... , a 2 .n 
VALID VALID (a 2 ) 

FROM (S c ) AS ai, trans(ft ,X') AS a 2 ) 

n is the length of r ft~ l , X' is the expression INTERSECT (ai. 1 , A), and S c is the second 
element of the pair (7, E c ) = h' cparts (a c ) (section |5.9[ ). Each time the translation rule 
is used, ct\ and «2 are two new different correlation names, obtained by calling the 
correlation names generator after A has been supplied. 
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Proof that theorem 5.2 holds for <fi = At[a c , (3,4>'], if it holds for 



The proof is very similar to that of section A.3.15 



A. 3.19 Before[a c ,fi,(j)'} 
Translation rule 

If u c G CPARTS, (3 G VARS, <f>' G YNFORMS, and A is a Tsql2 value expression, 
then: 

trans(Before[a c , (3, <fi'], A) d = 

(SELECT DISTINCT a x .l, a 2 .l, a 2 .2, ... , a 2 .n 
VALID VALID (a 2 ) 

FROM (E c ) AS ai, trans (ft , X') AS a 2 ) 

n is the length of r /_l , A' is the expression INTERSECT (PERIOD (TIMESTAMP 'beginning' , 
BEGIN(ai.l) - INTERVAL '1' x> > A) , x is the Tsql2 name of the granularity of chronons, 
and E c is the second element of the pair (7, E c ) = h' cparts (a c ). Each time the translation 
rule is used, a\ and «2 are two new different correlation names, obtained by calling 
the correlation names generator after A has been supplied. 



Proof that theorem 5.2 holds for (f) = Before[a c , f3, <//], if it holds for 



The proof is very similar to that of section A.3.15 



A.3.20 After[a c ,P,<f/\ 
Translation rule 

If cr c G CPARTS, /3 G VARS, <j)' G YNFORMS, and A is a Tsql2 value expression, 
then: 

trans(After[a c , (3, ft], A) d = 

(SELECT DISTINCT ai.l, a 2 . 1 , a 2 .2, ... , a 2 .n 
VALID VALID (a 2 ) 

FROM (E c ) AS ai, trans((j)' ,\') AS a 2 ) 

n is the length of r (p n , A' is the expression INTERSECT(PERIOD(END(ai.l) + INTERVAL '1' 
X, TIMESTAMP 'forever'), A) , x is the TSQL2 name of the granularity of chronons, and 
S c is the second element of the pair (7, S c ) = h' cparts (a c ). Each time the translation 
rule is used, a\ and ai are two new different correlation names, obtained by calling 
the correlation names generator after A has been supplied. 



Proof that theorem 5.2 holds for (j) = After[a c , ft, (/)'], if it holds for 



The proof is very similar to that of section A.3.15 
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A.3.21 M[<t>x,<h\ 
Translation rule 

If (f>i, fa G YNFORMS and A is a Tsql2 value expression, then: 

def 

trans(At[4>i,(j)2\, X) = 

(SELECT DISTINCT ai . 1 , ai.2, ... , a\.n x a 2 ■ 1 , a 2 .2, ... , a 2 .n 2 
VALID VALID (a 2 ) 

FROM trans((j>i, A lmi ) (NOSUBPERIOD) AS oty , 

trans((f) 2 ,X') AS a 2 ) 
WHERE . . . 
AND ... 

AND . . . ) 



Xi n it is as in section 5.10| , A' is the expression INTERSECT (VALID («i) ,A), and n\,ri2 are 



the lengths of r 4>i~ > and r 02 n respectively. Each time the translation rule is used, a.\ 
and «2 are two new different correlation names, obtained by calling the correlation 
names generator after A has been supplied. ot\ is generated after trans{<f>\, Xinit) has 
been computed. Assuming that r <pi n = {t\, r\, . . . , r„ x ) and r 4>2~ 1 = (t 2 , • • • , 7"n 2 ), the 
" . . . "s in the WHERE clause are all the strings in S: 

S = {"ai.i = a 2 .j" | i G {1,2,3, ... ,ni}, j G {1,2,3, . . . ,n 2 }, 
= r 2 , and r/,r 2 G VARS} 



Proof that theorem 5.2 holds for 



At[4>i,(j)2], if it holds for 



'>i and 



I assume that <j>i,(f>2 G YNFORMS. By the syntax of Top, this implies that Attyijfa] 6 
YNFORMS . I also assume that si G PTS, A is a Tsql2 value expression, A' and Xi n u 
are as in the translation rule, g db G G db , eval(st, X,g db ) G Dp, r 4>\~ 1 = (t^t^, . . . ), 



p, r </>r 

2 n = (r 2 , r| , • • • , t 2 2 ), and E = ircms(^4i[(/>i, fa], A). From the definition of r . . . n , it 



should be easy to see that: 



r At[^ lj ^ 2 ]-' = <n 1 ,r 2 1 



'n 2 / 



Finally, I assume that theorem 5.2 holds for 



and 



62 . I need to show that: 



1. FCN(E) C FCN(X) 

2. eval(st,T l ,g db ) G VREL P (m + n 2 ) 

3. . . . , f^, f 2 , . . . , v 2 2 ; Uj) G eval(st, E, </*) iff for some g £ G: 

\\rl\\ M ^ = f D (v{), ... , iKir(^ = / «), 

|| r 2||M Ml p = /d(w?)j _ || r 2 2 ||M(,t), 9 = /d(u 2 2)) and 

||yli[(/)i,(/)2]|| M( ' st) ' s *' /D(t ' t) ' /D(em/(s *' A ' 5,d ' ,)) ' 9 = T 
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From the definition of A' it should be obvious that any free column reference in A' is 



either the VALID (a\) of A', or is situated within the A of A'. By lemma AA, this implies 
that: 

(A.284) FCN(X') C FC7V(VALID(ai)) U FCN(X) 

The only correlation name that has a free column reference in VALID(ai) is a.\. Hence, 
FCW(VALID(ai)) = {01}, and ( 1P84D becomes ( |Q85|) . 



(A.285) FCN(X') C { ai } U FCN(X) 

The a\ of S is generated by calling the correlation names generator after A has been 
supplied. Hence, a.% cannot appear in A. The fact that ol\ does not appear in A implies 
that for every v\, v\, ■ ■ ■ , G D and v\ G Dt- 

(A.286) eval(st,X,(g db )^ lvl , , ) = eval(st, A, g db ) 



The fact that for every v\, v\, . . . , v„ G D and v\ G Dt, ( A.286 ) holds, and the 
assumption that eval(st, A, g**) £ .Dp, imply that (A. 287) holds for every v{, . . . , G 
D and ^ G D T . 



(A.287) /^(e^(^A,(^);\^ ^ ;utl) )) 



G PERIODS* 



By the syntax of Top, since A is a value expression, A' is also a value expression. 
(A.287) and the definition of A' imply that (A.288) and (A.289) hold for v\, . . . , G D 
and v\ G D P (D P C D T ). 

(A.288) f D (eval(st, A', C^)^^,^^))) = 



/dK 1 ) n f D (eval(st, A, (s^i^...^ .„!>)) 
(A.289) ml(^,A',(/)J } ^ .....^j)) G 



By the syntax of Tsql2, Aj n ji is a value expression, and by lemma |5.1| , (A. 290) - 
(A.292) hold. 

(A.290) f D (eval(st,X imt ,g db )) = PTS 

(A.291) eval(st, Xi n it,g db ) G Dp 

(A.292) FCN(X imt ) = 

Let Si be the first embedded SELECT statement in the FROM clause of S, i.e. Si = 
trans (</>i, A). From the hypothesis and the discussion above, <p\ G YNFORMS ', si G 
PT5, r ^i n = (r-j 1 , . . . , ), Xi n it is a Tsql2 value expression, eval(st, Xi n u, g db ) G Dp, 
and g 1 * G C*. From theorem |5.2| for cj) = (according to the hypothesis, theorem 5.2 
holds for 4> = 4>i), and using (A.290) and (A. 292), we get: 

l 1 . FCTV(Ei) = 

2 1 . em/(st,Si,/ b ) G VREL P { ni ) 
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3 1 . (v{, . . . , v^; v\) G eval(st, £1, <? ) iff f° r some g\ G G: 
Urir^i =f D (vl), ... , IKH^'W =/d«), and 
||0 1 ||Ar(<rt),«t,/D(« t 1 ),prs,fli _ T 



Let £2 be the second embedded SELECT statement in the FROM clause of £, i.e. £2 = 
trans ((^2, A'). From the hypothesis, fa G YNFORMS, st G PTS, G and 
r 02 n = (r 2 , . . . , t 2 2 ). From the discussion above, A' is a Tsql2 value expression. If 
G D and G D P , then (^^...^i) G G db and (by (A.289)) 

eval(st,X',(g db )^i vl vl ^) G Dp. From theorem for (f) = fa (according to 



the hypothesis, theorem holds for <f> = fa) we get the following. (The conditions 
v I, v 2 , ■ ■ ■ , v„i e ^ and u t e do not affect clause l 2 .) 



I 2 . FCW(£ 2 ) C FCN(X') 

2 2 . If Uj, U2, • • • , G I? and ^ G Dp, then: 

^(si.^.^jj^ ,...,«i 1 ^>) e VREL P {n 2 ) 

3 2 . If ^1,^2, • • • , u„j G D and G Dp, then: 
( 

(«!,..., ^ 2 ;wt) G eval(st,E 2 ,(g db ) ( ^i tV i ^i iff for some 52 G G: 
|| Ti2 ||M Wlff2 = /z?(v?)j ^ ||r 2 2 f^)^ =} D k 2 ), and 

||02 1| W-^.-,^^) =T 

) 



Proof of clause 1 



The VALID(a2) in the VALID clause of £ and the a\.l, . . . , ai.Bi, 0:2. 1, . . . , a 2 .n 2 in the 
SELECT clause of £ are not free column references in E, because £ is a binding context 
for all of them. Any column references of the form a\.i or a 2 .j in the WHERE clause 
of £ (i G {1, 2, 3, ... , m}, j G {1, 2, 3, ... , 77-2}; these column references derive from S) 
are not free column references in £ for the same reason. E contains no other column 
references (and hence no other free column references), apart from those that possibly 
appear within the £1 and £2 in the FROM clause of £. (The reader is reminded that 
£1 = trans(fa, Xinit) and £2 = trans(fa, X').) By lemma [A.l| , this implies (A. 293). 
(A.293), clause l 1 , and clause l 2 imply (A.294). (A.294) and flP85| ) imply (A.295). 

(A.293) FGiV(£) C FCW(Ei) U FCN(T, 2 ) 

(A.294) FCN(E) C FCN(X') 

(A.295) FCN(Z) C {ai} U FCN(X) 



a>i cannot have a free column reference in £. The proof follows. Let us assume that 
there is a free column reference £ of a\ in £ (£ has the form oi\.k or VALID(ai)). Then, 
by the definition of free column reference, (a) Q is part of £. From the translation rule, 
it is also the case that (b) a\ (the correlation name of £) is defined by the topmost 
FROM clause of E. 
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If C is in the topmost FROM clause of S, it can only be in the £1 (the trans{4>\, Xmit)) 
or the £2 (the trans (</>2, A')) of £. (, however, cannot be in Si, because a± (the 
correlation name of is generated after Si has been computed (see the translation 
rule), and the correlation names of the generator cannot be used before they have been 
generated (see section 5 . 1 1| ) . 



So, if C is in the topmost FROM clause of £, £ has to be in the £2 of £, i.e. after the 
definition of a\. Hence, (c) £ is either not in the topmost FROM clause of £, or it is in 
the topmost FROM clause of £, but it follows the definition of a\. (a), (b), and (c) imply 
that E is a binding context for which implies that Q is not a free column reference 
in S. This is against the hypothesis. Hence, a\ cannot have a free column reference 
in E, i.e. ai FCN(E). This and (A.295) imply that FCN(E) C FCN(X). Clause 1 
has been proven. 



Proof of clause 2 



When computing eval(st,Y,, g db ), the qi of E ranges over the tuples of the relation 
nosubperiod(eval(st,T,i, g db )). By clause 2 1 , eval(st, £1, g db ) G VRELp(ni). The defi- 
nition of nosubperiod (section |5.3.2 ) implies that nosubperiod(eval(st,'Ei, g db )) is also 



a valid-time relation of n\ explicit attributes, and that all its time-stamps are also time- 
stamps of eval(st, T>i,g db ), i.e. elements of Dp. That is, eval(st, T>i,g db ) contains tuples 
of the form (v{, . . . , v^vl), with v{, . . . , G D and v\ G Dp. By clause 2 2 , it is 
also the case that if v\, . . . , G D and v\ G Dp, then eval(st, £2, (g db )^i v i . v i^) G 

VRELp{ri2). It should now be easy to see from the translation rule that ( A.296j ) holds. 



(A.296) eval(st,J:,g db ) = 

{( v h ■ ■ ■ • • > v n 2 ; w *) I for some w t> 

; G nosubperiod(eval(st, Y<i,g db )), 
(vi,...,vl 2 ;v t ) £ eval{st,T t2 ,(g db )^i^ v ^. v i ) ), and 

if i G {1, 2, 3, ... , m}, j G {1, 2, 3, ... , n 2 }, r/, r? G VARS, 

-1 2 1 2i 



and Tj = , then t>j = Vj } 



(g )^i is used in the fourth line of ( A.297 ) instead of g , to capture the fact 

that if there is any free column reference of a\ in £2, this has to be taken to refer to 
the tuple of nosubperiod(eval(st,T,i, g )) to which a\ refers. 
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Using the definition of nosubperiod (section |5.3.2| ), ( |A,296| ) becomes ( |A.297 ). 

(A.297) eval(st,Z,g db ) = 

{( v h ■ ■ ■ , v ln v L ■ ■ ■ i v m'' v t) I for some v h 
{v\, . . . y ni ;v\) G eval(st,J: 1 ,g db ), 

there is no (v\, . . . , ; v ] ) G eval(st, Y<i,g db ) 

such that Jd{vI) □ foiyl), 
{v\,...,v 2 n2 ;v t ) G eTO^^a^/jJj ^^^j), and 

if i G {1, 2, 3, ... , m}, j G {1, 2, 3, ... , n 2 }, r/, r] G VAflS, 
and = Tj, then = Vj} 

( |A.297[ ) and the discussion above imply that for every (v\, . . . ,1^, i>f , . . . , v^ 2 ;vt) G 
em^E,/ 6 ), («?,..., t,^;^) G eval(st^ 2l (g db ) a { ^ G VREL P (n 2 ). This 

implies that ft G -Dp. ( |A.297| ) also implies that eval(st,Ti, g db ) is a valid-time relation 
of roi + n2 explicit attributes. Hence, eval{st,Y,,g db ) G VRELp{n\ + n 2 ). Clause 2 has 
been proven. 



Proof of clause 3 

I now prove clause 3. Using the definition of ||^4i[0i, (j) 2 ]\\ st ' et ' lt ' 9 (section |3.10| ), clause 
3 becomes: 

(v\, . . . , , v\ , . . . , Vn 2 ;vt) G eval(st, S, g db ) iff for some g and et' : 
(A.298) g G G 

(A.299) Wrh"™* = fnivl), \\r l ni \\ M ^ = 
(A.300) ||7f = • • • , ||r n 2 2 || M ^ = 

(A.301) et' G mxZpers({e G PERIODS | \\fa\\MW,st,e,PTSj, = r |) 

(A. 302) ||0 2 || M ( s *). s *./i3(^)./-D( e ' uai ( si : A .9 d!, ))nei',g _ ji 



I first prove the forward direction of clause 3. I assume (v{, . . . , v^jvf, . . . , i^ 2 ; vt) G 
eval(st,T l ,g db ). I need to prove that for some g and et', (|A.298|) - (|A.302[) hold. The 
assumption that . . . , v\ , v 2 , . . . , t> 2 2 ; vt) G eval{st,Y*,g db ) and ( A.297] ) imply that 



for some vj: 



(A.303) 
(A.304) 

(A.305) 
(A.306) 



{v\,...y ni -v\) G eval{st^g db ) 
there is no (v\, . . . , v\ ; v\ ) G eval(st, T,\,g d 

such that Jd{vI) □ foiv}), 
. . . , <; u t ) G eualCai, E 2 , (/ 6 );\ ) ... j<;t; i ) ) 
if i G {1, 2, 3, ... , m}, j G {1, 2, 3, ... , n 2 }, t},t] G 

and = Tj , then = Vj 
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.l||M(st),gi 

\M(st),st,fn(v\),PTS,gi 



m I! ■" = /dOi 1 ), |r ni 



1 ||M(st),gi 



( |A.303[ ) and clause 3 1 imply that for some g±: 

(A.307) 3i G G 

(A.308) 
(A.309) 

( |A.303[) and clause 2 1 imply that: 

(A. 310) v\,...,v l ni e D and u£ G D P 

(|A.305D , flOlCp , and clause 3 2 imply that for some g2- 



/d«) 



T 



(A.311) 
(A.312) 

(A.313) 



92 € G 

||_2||M(st), ff2 



/d(vi), ||r, 



|M(st),ff2 



2 i 
ri2 / 



M(st),8t,f D (vt),fD(eval(st,X',(g db )''\ i i i,)),<W 



I define the mapping g : VARS \— ► OBJS as follows: 



3(/?) 



f 31 if for some i G {1,2,3, ... ,ni}, /3 = r/ 
32 O 9 ), if for some j G {1, 2, 3, ... , n 2 }, = r? 
o, otherwise 



where o is a particular element of OBJS, chosen arbitrarily. ( A. 298 ) follows from 
lemma |AJ| the definition of g, ( |A.306|) , ( |A.308|) , and ( |A.312j ). I set et' as in ( [A.314Q , 
and show that (|A.29S| ) - (|A.302j ) hold. 



(A.314) 



The definition of 5 implies that for every variable (3 among t\, . . . , r^, g(/3) = 3i(/3). 



The assumption that r <; 



r 1 ' 

1 1 nil 



and the definition of r 0i n imply that all the 



variables of 4>\ are among rf, . . . , r„ . Hence, for every variable /3 in </>i, g(/3) = 31 (/3). 
Similarly, the definition of <? and the assumption that r 02 n = (tj, . . . , ) imply that 
for every variable ^ in 02, 3(/9) = 92(P)- 



( A. 299 ) follows from lemma [AT3] , the assumption that (r^, . . . , ) = r 0i n , ( A.308| ), 
and the fact that 5 and g\ assign the same values to all the variables of <j>\. The proof 
of ( IQocI) is very similar. 



I no w prove (|A.302|) . (|A~28^ ) and (A . 288) (w hi ch hold for v\, . . . , G D and v\ G -Dp) 
and (|A.310|) imply (|A.315|) . (|A.313|) , (|A.315|) , (|A.314|) , and the fact that g and # 2 assign 
the same values to all the variables of 02 (see above), imply ( A.302[) . 



(A.315) f D (eval(st,X\(g db )^ iy ^^^ 

It remains to show flOoip . (|A~30l ), clause 2 1 , and flOSp imply (|A.316|) . 
(A.316) et' G PERIODS 

(|A.316|) , ( |A.314| ), (|A.309|) , and the fact that 5 and 52 assign the same values to all the 
variables of <f>\ imply ( A.317| ). 

\\M(st),st,e,PTS,g 



(A.317) 



et' G {e G PERIODS 



T} 
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To prove ( |A.301| ) it remains to prove that there is no et" that satisfies both (A. 318) 
and (A.319). 



(A.318) 
(A.319) 



et" £{e£ PERIODS | ||<£i 
et' c et" 



M(st),st,e,PTS,g 



T} 



Let us assume that for some et" , (A.318) and (A.319) hold. (A.318) and the fact that 
g and g\ assign the same values to the variables of 4>\ imply that: 



(A.320) 
(A.321) 



u^fuM(st),st,el/',PTS,gi 
et" G PERIODS 



I S et v}' = f^iet"), which implie s (A.32 2). flO07| ), ( [A308|) , (A.322), ([A320D , and 
clause 3 1 imply (A.323). (A.319), ( |A3lD , and (A.322) imply (A.324). 



(A.322) 
(A.323) 
(A.324) 



et" 



f D (vj') 



(vi, . . .,v n ;vl) G eval(st,T ll ,g db ) 
Uvl) c f D {v]') 



(A.323) and (A.324) are against (A. 304). Therefore, there can be no et" that satisfies 
(A.318) and (A.319). ( |A.301| ) and the forward direction of clause 3 have been proven. 



I now prove the backwards direction of clause 3. I assume that for some g and et' , 
QA.298D - ( |A.302|) ho ld. I must show that (v{, . . . , , v{, . . . , v ^; v t ) € ev al(st,T ,, g dh ). 
According to (IA.297D , it is enough to prove that for some v\, ( |A.303| ) - ( |A.306| ) hold. 



I first prove ( |A.303| ). I set vj = f D 1 {et') 1 which implies (A. 325). ( |A.301 ) implies 
(A.326). Clause 3 1 , ( |A.298Q , flOSap , (A.325), and (A.326) imply flP55p . 



(A.325) 
(A.326) 



et' = f D (v l t ) 

^ uM{st),st,et' ,PTS,g _ j, 



I now prove ( A.305| ). ( A.303| ) (proven above) and clause 2 1 (also proven above) imply 
that: 



(A.327) 



Vi,...,v n G D and v t G Dp 



QA.286D and (A .288) (which hold fo r v{,.. ., y^ £ D and v j- G -Dp) , (|A.327|), and 
(A.325) imply flO^ . Clause 3 2 , d057|) , (|^298|) , ( |O0CD , ( Qjl ), and ([Ool ) 
imply dOS5p . 



(A.328) 



f D (eval(st, A', (/ 6 ); i i )U i r .. !<i .„i > )) = f D (eval(st, X,g db )) f] et' 



I now prove ( A. 306 ). Let us assume that i G {1, 2, 3, . . . , ni}, j G {1, 2, 3, . . . , n^}, 
rl,rj G VARS, and r/ = r|. From (|A.299D we get (A.329), and from (1A.300Q we get 
(A.330). 



(A.329) 
(A.330) 



-l||Af(st),p 



_2nM(st),g 
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(A.329), (A.330), and the hypothesis that t\ = r? imply that f D (v}) = /d(«|)- This 
in turn implies that /d 1 (/d(u*)) = fp (/d(u?)) ; i-e. that vj = Vj. ( A.306| ) has been 
proven. 



It remains to prove ( [A.304|) . Let us assume that there is a tuple (v{, . 
that (A.331) and (A.332) hold. 

(A.331) (v{,...,vi;vl')eeval(st,Z 1 ,g db ) 
(A.332) f D (v\) C f D {v l t ') 

(A.331) and clause 3 1 imply that for some cf: 
(A.333) g' G G 

(A.334) Unir^^' = f D (v{), \\r n \\ M{stW = M<) 



(A.335) 



l)i i\M(st),st,f D (vl'),PTS,g' 



Lemma [OJ QA.29SQ , (|A.333|) , the assumptions that fa G YNFORMS and r fa' 1 = 
(rl, ■■■,Tn 1 ), ( |A.299| ), and ( |A.334| ) imply that g and g' assign the same values to the 
variables of fa. This implies (A.336). (|A.335|) and (A.336) imply (A.337). 



\M{st),st,f D {vl'),PTS,g' 



\M{ S t),st,f D {v\'),PTS,g 



\M{st),st,f D {v\'),PTS,g _ rp 



(A.336) 

(A.337) ||0i | 

I set et" as in (A.338). (A.332), (A.325), and (A.338) imply (A.339). (A.337) and 
(A.338) imply (A.340). 

(A.338) et" = f D (vj') 

(A.339) et' C et" 



\M(si),st,et",PTS,g 



T 



(A.340) 

(A.339) and (A.340) are against ( |A.301| ), because (A. 301) and the definition of mxlpers 



M(st),st,et",PTS,g 



(section |3.3j) imply that there is no et" , such that et' □ et" and 
T. Therefore, there can be no tuple (v\, . . . ;v\ ) such that (A.331) and (A.332) 
hold. ( |A.304j ) and the backwards direction of clause 3 have been proven. 



A. 3.22 Before[fa,fa) 
Translation rule 



If 4>i,4>2 6 YNFORMS and A is a Tsql2 value expression, then: 

def 

trans(Before[fa, fa], A) = 

(SELECT DISTINCT ai . 1 , «i.2, ... , a\.ri\ a 2 ■ 1 , a 2 .2, ... , a 2 .n 2 
VALID VALID (a 2 ) 

FROM trans(fa, Xinit) (NDSUBPERIDD) AS a x , 

trans(fa, A') AS a 2 ) 
WHERE . . . 
AND . . . 



AND . . . ) 
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Xinit is as in section |5.10| , A' is the expression INTERSECT (PERIOD (TIMESTAMP 'beginning' , 
BEGIN (VALID (ai)) - INTERVAL '1' x) • A) , and x is the Tsql2 name of the granularity 
of chronons. m and n 2 are the lengths of r (pi~ l and r 4>2~ 1 respectively. Each time the 
translation rule is used, a\ and 0:2 are two new different correlation names, obtained 
by calling the correlation names generator after A has been supplied. a\ is generated 
after trans(<pi, Xinit) has been computed, and before computing trans(<p2, A'). Assum- 
ing that r 0i n = (r-j 1 , . . . , t* ) and r ^ 2 n = (rf, . . . , ), the "... "s in the WHERE clause 



are all the strings in the set S of section A. 3.21 



Proof that theorem 5/2 holds for <p = Before[4>i, fo], if it holds for 



and 



The proof is very similar to that of section |A.3.21 



A. 3.23 After^fa] 
Translation rule 

If 4>i,(p2 S YNFORMS and A is a Tsql2 value expression, then: 

def 

trans(After[<pi,<p2], X) = 

(SELECT DISTINCT ai-1, a x .2, ... , a x .n x a 2 . 1 , a 2 .2, ... , a 2 .n 2 
VALID VALID (a 2 ) 

FROM trans{cj)\, Xi n a) (NOSUBPERIOD) AS a\ , 

trans(4>2, X') AS a 2 ) 
WHERE . . . 
AND . . . 



AND 



) 



ni,ri2 are the lengths of r 4>\~ 1 and r 4>2~^ respectively. Xi n n is as in section 
A' is the expression INTERSECT (PERIOD (END (VALID (ai) ) + INTERVAL >1> \> TIMESTAMP 
'forever') , A), and \ ls the TSQL2 name of the granularity of chronons. Each time 
the translation rule is used, a\ and 02 are two new different correlation names, obtained 
by calling the correlation names generator after A has been supplied. a\ is generated 
after trans((f>i, Xi n u) has been computed, and before computing trans(4>2, A'). Assum- 
ing that r 4>i~ l = (tI, . . . , t4 ) and r 4>2~ 1 = (rf , . . . , ), the "... "s in the WHERE clause 



are all the strings in the set S of section A. 3. 21 



Proof that theorem 5.2 holds for 



After [ 



if it holds for 



and 



The proof is very similar to that of section 



A3. 21 
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A. 4 Translation rules for wh-formulae and proof of theo- 
rem |b.l| 



This section lists the translation rules for wh-formulae. These rules have to satisfy the- 
orem [3j] (see section 5.10 ), There are two translation rules for wh-formulae. They cor- 
respond to the cases where the Top formula <j) to be translated belongs to WHFORMSi 
or WHFORMS 2 
holds if 



(see section ^2 
G WHFORMSi or 



g 



. Each rule is followed by a proof that theorem p.l 
WHFORMS2 respectively. 



As explained in section 5.11, the translation rules for wh-formulae define trans((j), A) 
only for A = Xmit- The values of trans(cfi, A) for <f> G WHFORMS and A 7^ \i n u are 
not used anywhere (they are also not examined by theorem 5.1) and can be chosen 
arbitrarily. 



A.4.1 ?ft ?/? 2 ?/%... ?/? n 0' 
Translation rule 



If ft, ft, . . . , ft G VARS, cj)' G YNFORMS, and X init is as in section pTToj , then: 
trans{Wi ?ft ?/%... ?ft A init ) 

(SELECT DISTINCT SNAPSHOT a.^i , a.w 2 , •■■ , a.w„ 

FROM trans((j)' , Ximt) AS a) 

Each time the translation rule is used, a is a new correlation name, obtained by calling 
the correlation names generator. For every i G {1, 2, 3, . . . , n}, 



min({j | j G {1,2,3, ... , k} and Tj = Pi}) 



where (ti,T2,T3, . . . ,Tk) = r (f) n . That is, the first position (from left to right) where 
Pi appears in (n, . . . , r^) is the Wj-th one. 



Proof that theorem 5.1 holds for <j> =?ft ?ft ?ft ... ?ft 



I assume that ft, ft, • • • , ft G K4i?5 and </>' G YNFORMS. By the syntax of Top, 
this implies that ?ft ?ft ?ft ... ?ft 0' G WHFORMS. I also assume that si G PTS, 
Xinit is as in section |5.10| , £ = trans{iP\ ?ft ?ft ••• ?ft <P' ■, ^init) , and that r n = 
(tl, . . . , Tfe) (as in the translation rule). I need to show that: 

1. FCN(£) = 

2. eval(st,T,) G SREL(n) 

3. {(/d(ui), • • • ,/o(un)) I • • • ,Wti> G euaZ(si,£)} = 
||?ft ?ft?ft ...?/?„ ^H M(st),flt 
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According to the syntax of Tsql2, \i n it is a value expression. Let g db be an arbitrary 
member of G db . By lemma fy], (A.341) - (A.343) hold. 

(A.341) f D (eval(st,X mit ,g db )) = PTS 

(A.342) eval(st,X mit ,g db ) £ D* P 

(A.343) FCN{\ mit ) = 

Let E' be the SELECT statement in the FROM clause of £, i.e. E' = trans((f>' , \init)- 
From the hypothesis and the discussion above, <ft G YNFORMS , st G PTS, r (j)'~ l = 
(ti, . . . ,Tfc), g db G G dfe , Aj n j( is a Tsql2 value expression, and eval(st, Xi n it, g db ) G -Dp. 
Then, from theorem |5.2| (already proven), and using (A.341) and (A.343), we get: 



1'. FCTV(E') = 

2'. euaZ(si,S / ) G VREL P (k) 

3'. • • • ,v' k ;v' t ) G eval(st,T,') iff for some g £ G: 

|| Tl ||M( s t), 5 = . . . , ||r fc || M ^)'9 = /z?K), and ||^||W(rt) ) rt,/ D (t4) ) PrS )fl = T 

In clauses 2' and 3' I have not included g db among the arguments of eval(st, £'), because 



according to clause 1', FCN(T,') = (see comments in section 5.4). 



Proof of clause 1 



The a.ux, . . . , a.uj n in the SELECT clause of £ are not free column references in X, 
because E is a binding context for all of them. E contains no other column references 
(and hence no other free column references), apart from those that possibly appear 
within £'. By lemma A.l, this implies (A. 344). Clause 1' and (A. 344) imply clause 1. 



(A.344) 



FCN(E) C FCTV(E') 



Proof of clause 2 

When computing eval(st, E), the a of E ranges over the tuples of eval{st,Y>'). (I 
do not include in the arguments of eval(st, E) and eval{st,H') the assignment to the 
correlation names, because according to clauses 1 and 1', FCN(Ti) = FCN(T,') = 0.) 
By clause 2', eval(st,T,') G VRELp(k). Hence, a ranges over tuples of the form 
(v[, . . . ,v' k ;v' t ) G eval(st, E'). 

The syntax of Top (section |]| and the fact that ?/3i ... ?/3 n 0' G WHFORMS imply 
that for every i G {1,2,3,... ,n}, occurs at least once within eft'. This and the 
definition of r . . . n imply that /3j occurs at least once within r </> n , which according to 
the hypothesis is (t\, . . . ,r k ) . Hence, for every i G {1, 2, 3, . . . , n}, the set {j \ j G 
{1, 2, 3, . . . , k} and Tj = Pi} in the definition of uji (see the translation rule) is not 
empty, and u>j G {1, 2, 3, ... , k}. Assuming that a refers to a tuple (v[, . . . , v' k ; v' t ) G 
eval(st, £'), for every i G {1, 2, 3, ... , n}, the a.tOi of E (see the translation rule) refers 
to v'^., i.e. the u;j-th (from left to right) among v[, v' 2 , ■ ■ ■ , v' k . 
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It should now be easy to see from the translation rule that: 

(A.345)emZ(^,£) = . . . | 

(i^, ^2, i>3, • • • ,v' k ;v t ) G eval(st, £ ), and 
for every i G {1, 2, 3, ... , n}, 

= min({j | j G {1,2,3, ... , fc} and Tj = A}) } 



( A. 345 ) implies that eval(st, S) is a snapshot relation of n attributes. Clause 2 has 
been proven. 



Proof of clause 3 

By the definitions of ||0|| M ( S *)> S * and [|0||W(«t),«t,rf,K^ for e WHFORMSi (section 
clause 3 becomes: 



(A.346) {{fD(vi),f D (v 2 ),--- ,fD(v n )) | (^l) ^2) • • • ; u n) G eval(st, £)} 

U {(#),S(A), ••,5(/?n)) I ||0'||^)>*wrc, fl=r} 



I prove (|Q4l ) by proving that its left-hand side (LHS) is a subset of its right-hand 



side (RHS), and that its RHS is a subset of its LHS. I start with the proof that 
the LHS of ( |A.346| ) is a subset of the RHS. I assume that for some vx,...,v n G D, 



(/d(ui), . . . , fD(v n )) is an element of the LHS of (|A.346|) , i.e. that ([A.347|) holds 



(A. 347) (vi, . . . , v n ) G eval(st, E) 

I need to prove that . . . , fD{v n )) is an element of the RHS of QA.346D , i.e. that 

for some g and et: 

(A.348) g G G 

(A. 349) et G PERIODS 

(A.350) f D ( Vl ) = g{fix% . . . , fD(v n ) = g{Pn) 

(A.351) ||^||M( S t), S t,ei,PTS,s = T 



( A.347 ) and ( A.345| ) imply that for some v[, . . . , v' k , v' t : 



(A.352) (v[, . . . , v' k ;v' t ) G eval(st, £') 

(A.353) Vl = v ' V2 = v ' ... , Vn = v ; 



(A.354) for every i G {1,2,3, ... ,n}, 

u)i = min({j | j G {1,2,3, ... , k} and tj = A}) 



( A.352 ) and clause 3' imply that for some g: 



(A.355) g G G 

(A.356) ||ti|| M ^ = f D (v[), \\r k \\ M ^ = f D ( v ' k ) 

(A.357) ||0/||Af(st),at,/ Z 3K). i ' T5 .fl = T 
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(A. 348) is the same as (|A.355| ), which is known to be true. I set et as in (A. 358). Then, 
fejjjjj) follows from (|A.357| ). (]A7352|) and clause 2' imply (A.359). (|AT349|) follows 
from (A.358) and (A.359). 



(A.358) 
(A.359) 



et = fniv't) 
v' t E Dp 



It remains to prove ( A.350|) . For every i G {1,2,3,... , n}, since fa E VARS, according 
to the semantics of Top, (A.360) holds. flOjjg ) implies (A.361) and (A.362). 



(A.360) 
(A.361) 
(A.362) 



<?(A) = IIAII M(st) ' 9 

7^ = fa 

Wi E {1,2,3,... , A;} 



(A.361) and (A.360) imply (A.363). (A.362) and flQgjj ) imply (A.364). (A.363) and 
(A.364) imply (A.365). 



(A.363) 
(A.364) 
(A.365) 



9{fa) 



\M(st),g 

gWi) = fo{v') 



\M(st),g 



( |A.353 ) and the assumption that i E {1, 2, 3, . . . , n}, imply (A. 366). (A. 366) and 
(A.365) imply (A.367). 



(A.366) 
(A.367) 



g(fa 



= Vi 

fo{vi) 



I have proven that for every i E {1,2,3,... , n}, /d(^«) = 9 (Pi)- Therefore, ( A.35C| ) 
holds. The proof that the LHS of (^34|) is a subset of the RHS has been completed. 



I now prove that the RHS of (A. 346) is a subset of the LHS. I assume that for some 
g E G, (g(Pi), g(P2)i ■ ■ ■ ,g(Pn)} is an element of the RHS of ( [A, 346 ), i.e. that for some 
g and et, ( |A.368| ) - flO?C| ) hold. 



(A.368) 
(A.369) 
(A.370) 

I need to prove that (g(Pi), g(P2) 
that for some v±, . . . , v n E D: 



g^G 

et E PERIODS 

\\ii\\M(st),st,et,PTS,g 



,g(Pn)) is an element of the LHS of (A.346), i.e 



(A.371) 
(A.372) 



g(Pl) = fD(vi), • • • , g(Pn) = fD(Vn) 

(«!,..., v n ) E eval(st, S) 



I set vx, . . . , v n as in ( A.373[ ), which implies that v\, . . . , v n E D and that ( A.371 ) holds. 

(A.373) Wl = fE\g(Pl)), ...,V n = f D 1 (g(Pn)) 
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It remains to prove ( |A.372| ). By ( A.345| ), it is enough to prove that for some v[, v' 2 , 
... , v' k , v' t , ux, u 2 , ... , u n : 

(A.374) {v[,...,v' k ;v' t ) £ eval(st,E') 

(A.375) i>i=i/ Ul , v n = v' u)n 

(A. 376) for every i £ {1, 2, 3, ... , n}, 

u)i = min({j | j £ {1,2,3, ... , k} and Tj = ft}) 

I set v[, v' 2 , ■ ■ ■ , v' k as in (A. 377), which implies (A. 378). I also set v' t = f^ 1 {et), which 
implies (A.379). Clause 3', (j068|) , (A.378), (A.379), and ( |A~370|) imply (|A~37l ). 



(A.377) v[ = fBWlnt^' 9 ), ■ ■ ■ , v' k = fn\\\Tk\\ M{st) > a ) 

(A.378) ||ti|| m ^ = f D (v[), \\n\\ M ^ = f D ( v ' k ) 

(A.379) et = f D (v' t ) 



For every i £ {1, 2, 3, . . . , n}, I set u>i as in (|A.376j ). It remains to prove ( A.375| ). 



([A.373D implies that for every i £ {1, 2, 3, . . . , n}, (A.380) holds. Since ft £ VARS, 
from the semantics of Top we get (A.381). (A.380) and (A.381) imply (A.382). 

(A.380) Vi = fEHgifo)) 

(A.381) 9{l3i) = m\\ M{stU 

(A.382) Vi = fB X m\\ M{at) ' a ) 

( |A.376D (which holds; see above) and the assumption that i £ {1, 2, 3, . . . , n} imply 
that: 

(A.383) ft = 

(A.384) uji £ {1,2,3, ... ,k} 

Using (A.383), (A.382) becomes (A.385). (A.384) and (A.377) imply (A.386). 
(A.385) m = fBHlK^^) 

(A.386) <=/^(ll^|| M(st) ' 9 ) 

(A.385) and (A.386) imply that Vi = v' UJi . I have proven that for every i £ {1, 2, 3, . . . , n}, 
Hence, ( A.375| ) holds. The proof that the RHS of ( |A.346|) is a subset of the 



Vi = v„ 



LHS has been completed. 

A.4.2 l mxl ^ ?ft ?/%... ?/?„ ft 
Translation rule 



If ft , ft>, ■ ■ ■ , @n £ VARS, <j)' £ YNFORMS, and \ mit is as in section |BTlO| , then: 

trans(? mx iPi ?ft> ?ftj • • • ?fti <t>', \nit) = 

(SELECT DISTINCT SNAPSHOT VALID (a 2 ) , a 2 .2, a 2 .3, ... , a 2 .n 
FROM (SELECT DISTINCT 'dummy', ai.W 2 , ai.w 3 , ... , a\.uj n 
VALID Qfi.Wi 

FROM trans(4>' ,\ imt ) AS ai 
) (NOSUBPERIOD) AS a 2 ) 
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Each time the translation rule is used, ol\ and a 2 are two different new correla- 
tion names, obtained by calling the correlation names generator. For every i G 
{1,2,3, ... ,n}, 

Ui = min({j | j G {1,2,3, ... , k} and Tj = Pi}) 

where (t\,T2,T3, . . . ,Tk) = r (j)'~ l . That is, the first position (from left to right) where 
Pi appears in (ti, . . . , r^) is the Wj-th one. 



Proof that theorem 5.1 holds for (j) =? mx iP± ?/3 2 ^Pz ■ ■ ■ ^Pn <f>' 



I assume that Pi, p 2 , . . . , P n € VARS and <f>' G YNFORMS. By the syntax of Top, this 
implies that ? m xlPl ?/% ■ ■ ■ ^Pn 4>' G WHFORMS. I also assume that st G PTS, 
Xinit is as in section |5.10| , E = trans(? mx iPi ?/5 2 ?/% ••• ?Ai ft, Knit), and that 



(ri, . . . , Tk) (as in the translation rule). I need to show that: 



1. FCN(Y) = 

2. eval(st,T,) G SREL(n) 

3. {(/d(-ui), • • • , /d(-u„)) I G eua£(si,£)} = 

V-mxlPl IP* 103 ...Wn<P'\\ M{st) ' St 

Let £' be the embedded SELECT statement to which «i refers, i.e. E' = trans((j)', X 



inif / 



Following exactly the same steps as in section A.4.1, we arrive at the conclusion that 



1'. FCTV(E') = 

2'. ewtZ^E') G VREL P (k) 

3'. v^; t^) G eval(st, £') iff for some g £ G: 

|| Tl ||M( s t), 5 = / D ( w ' 1 ) ) . . . ; ||ri.|| M ( s *)'S = /dK), and ||0'||M( s t), s t,/ D K),Pr5, g = T 



Proof of clause 1 



The VALID (a 2 ) , «2 • 2, . . . , a 2 • n in the SELECT clause of E are not free column references 
in £, because £ is a binding context for all of them. The a\.ui, . . . , ai.cj n in the VALID 
and the SELECT clauses of the embedded SELECT statement to which a 2 refers are also 
not free column references in E, because the SELECT statement to which a 2 refers is 
a binding context for all of them. E contains no other column references (and hence 
no other free column references), apart from those that possibly appear within £' (the 
trans(4>' , Xi n it)). By lemma A.l, this implies ( A.387| ). Clause 1' and (A. 387) imply 
clause 1. 



(A.387) 



FCA(S) C FCN(r!) 
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Proof of clause 2 



When computing eval(st, £), the aq of £ ranges over the tuples of eval(st, £'). (I 
do not include in the arguments of eval(st, S) and eval(st,H') the assignment to the 
correlation names, because according to clauses 1 and 1', FCN(T,) = FCN(T,') = 0.) 
By clause 2', eval(st,T,') G VRELp(k). Hence, aq ranges over tuples of the form 
(v' 1 ,...,v' k ;v' t ) G eval{st,Y>'). 

The syntax of Top (section and the fact that ? mxl Pi . . . lp n $ G WHFORMS 
imply that for every i G {1, 2, 3, . . . , n}, Pi occurs at least once within <p' . This and 
the definition of r . . . n imply that Pi occurs at least once within r (f)'~ > , which according 
to the hypothesis is (ri, . . . , r^). Hence, for every i 6 {1,2,3,..., n}, the set {j \ j G 
{1, 2, 3, . . . , k} and Tj = Pi} in the definition of uji (see the translation rule) is not 
empty, and u>i G {1, 2, 3, ... , k}. Assuming that aq refers to a tuple (v[, . . . , v' k ; v' t ) G 
eval(st, £'), for every i G {1, 2, 3, ... , n}, the ac\.u)i of £ (see the translation rule) refers 
to v'^., i.e. the Wj-th (from left to right) among v[, v' 2 , ■ ■ ■ , v' k . 

The reader should now be able to see that the embedded SELECT statement of the 
translation rule returns the following relation, where u\, . . . , u n are as in the translation 
rule. (I assume that the TSQL2 string 'dummy' evaluates to the element of D dummy. )0 

(A. 388) eval(st, SELECT DISTINCT 'dummy', oli.uj 2 , ai.u 3 , ... , ai.u n 
VALID ai.wi 

FROM trans(4>' , X tmt ) AS «i) 
= {{dummy, v' UJ2 ,v' UJ;i1 ...,v' UJn \v'u 1 ) \ (v[, v' 2 , v' 3 , . . . , v' k ; v' t ) G eval(st, £')} 



Let us use r(st) to refer to the relation of (A.388j), It should be easy to see from the 



translation rule and the semantics of (NDSUBPERIDD) (section |5.3.2j ) that: 
(A.389) 

eval(st,E) = {{v t ,v 2 ,v 3 , ...,v n ) \ 

{vi, t>2, t>3, • • • , v n ;vt) G nosubperiod(r(st))} 



Using the definition of nosubperiod (section |5.3.2j ) , ( A,389| ) becomes ( |A390| ). 



(A.390) eval(st,T,) = {(v t , v 2 , v 3 , ... ,v n ) \ (v 1} v 2 , v 3 , v n \ v t ) G r(st), 

and if (vi,V2,v 3 , . . .,v n ;u t ) G r(st), 
then f D (v t ) t- fDiut)} 



Replacing r(st) by the relation of (|A.388|) , ( |A.390| ) becomes (|A.391|) . (|A.391| ) is equiv- 



3 The reader may wonder if v' ull is always a valid time-stamp, i.e. if there is any guarantee that 
v' wi 6 Dt- Intuitively, this is indeed the case, because v' Ul corresponds to /3i, and the syntax of 
Top requires /3i to occur at least once within <j>' as the first argument of a Past, Perf, At, Before, 
After, or Ntense operator. The semantics of Top requires variables that occur at these positions to 
denote periods, and hence v' ui will also denote a period, i.e. v' u £ Dp C Dt- This, however, will 
not be proven formally here. 
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alent to 
(A.391) 



(A.392) 



eval(st,T.) = {(v t ,v 2 ,v 3 , . . . ,v n ) 
vi = dummy, v 2 



V 3 



J L0l 1 



{v'i,v' 2 ,v' 3 , . . .,v' k ;v' t ) E eval(st,H'), 
and if v\ = dummy, v 2 = v" 2 , v 3 = 

and «, v'l v'l, v'l; v'() E eval(st, S'), then f D (v t ) £ f D (u t )} 
eval(st,Z) = {{v'^v'^v^,... ,v' Un ) \ 
{v'i,v' 2 ,v' 3 , . . .,v' k ;v' t ) E eval(st,H'), 
and if (t^ , v 2 , v 3 , . . . , v k ; v' t '} E eval(st, £ ) and 



v" v' 



2 holds. 



C s : v L n = v 'L n , then /d(«wJ t /oKj} 
implies that eval(st, S) is a snapshot relation of n attributes. Therefore, clause 



Proof of clause 3 

By the definitions of \\<j)\\ M ^' st and \mM(st),st,et,lt,g f or ^ G WHFORMS 2 (section 



3.6), clause 3 becomes: 



(A.393) {{fD(vi),fD(v2),f D (v 3 ),...,f D (v n )) \ {v 1 ,v 2 ,v 3 , . . . ,v n ) E euaZ(si, £)} 

U {<s(/3l),5(/3 2 ),...,2(/?n)> | \\ (j) '\\mst),st,et,PTS, 9 = T ^ 

g£G,et£PERIODS 

and for no et' E PERIODS and g' € G is it true that 

9(^2) = J(Jh), 9(Pn) = g'(Pn)} 

I prove ( |A.393| ) by proving that its left-hand side (LHS) is a subset of its right-hand 
side (RHS), and that its RHS is a subset of its LHS. I first prove that the LHS is a 
subset of the RHS. I assume that for some v\, . . . , v n E D, (/d(i>i), . . . , /r)(f n )) is an 
element of the LHS of ( |A.393| ), i.e. that ( |A.394| ) holds. 



(A.394) (v\, . . . , v n ) E eval(st, E) 



I need to prove that (/d(^i), • • • , fD(v n )) is also an element of the RHS of ( A.393|) , i.e. 
that for some g and et: 

(A.395) g£G 

(A.396) et E PERIODS 

(A.397) f D ( Vl ) = gfa), f D (v n ) = g((3 n ) 

(A.398) y\\M(st) M PTS,g = T 

(A.399) for no et' E PERIODS and g' E G is it true that 

m M(st),st,et',PTS, 9 > = Tj g{f3i) |— and 
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( |A.394p and ( |A.392| ) imply that for some v' x , . . . , v' k , v' t : 

(A.400) (v' 1 ,...,v' k ;v' t ) G eval(st,H') 
(A.401) ui = v 2 = v' u)2 , v n 
(A.402) if (v", v'z, v'i, v'l; v") G eval(st, £') and 

< = C <, = C *C then / D «) £ / D «) 

where u>i, . . . , oj n are as in the translation rule. 
([A.400[) and clause 3' imply that for some g: 

(A.403) geG 

(A.404) ||ri|| M ^ = f D (v[), \\r k \\ M ^ = f D { v ' k ) 

(A.405) ^'\\M{st),stJ D {v' t ),PTS,g _ rp 

(A. 395) is the same as (|A.403| ), which is known to be true. I set et as in 
(|A.398|) follows from JA.405Q . 

(A.406) et = f D (y' t ) 

( [A.400[ ) and clause 2' imply that v' t G D P , which in turn implies (|A.407| ). (|A.396| ) 
follows from (|A.406Q and (|A.407|) . 

(A.407) f D (v' t ) G PERIODS 

I now prove ( |A.397j ), For every i G {1, 2, 3, ... , n}, since Pi G VARS, according to the 
semantics of Top: 

(A.408) g(Pi) = \W\ M{st) ' g 

The definition of u\, . . . , u n in the translation rule implies that : 

(A.409) r Ui = Pi 

(A.410) Ui G {1,2,3,..., k} 

(|A.409| ) and (|A.408| ) imply (A.411). (|A.4iq) and (|A.404| ) imply (A.412). (A.411) and 
(A.412) imply (A.413). 

(A.4H) g(Pi) = \K i \\ M{st ^ 9 

(A-412) \KM M{st) ' 9 = f D (vU) 

(A.413) 9(Pi) = fDMn) 

(|A.401|) and the assumption that « G {1, 2, 3, . . . , n}, imply (A.414). (A.414) and 
(A.413) imply (A.415). 

(A.414) = «i 

(A.415) gift) = f D { Vi ) 

I have proven that for every i G {1,2,3,... ,n}, /o(uj) = g(Pi). Therefore, ( A.397| ) 
holds. 
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It remains to prove ( |A.399 ). Let us assume that for some et' £ PERIODS and g' G G, 
(A.416) and (A.417) hold. 



(A.416) 
(A.417) 



\\^t\\M{st),at,et',PTS,g' _ rp 



For j £ {1, 2, 3, ... , k}, I set v] = fE 1 (hj\\ M( - st) ' 9 '), which implies (A.418). I also set 
v'l = f~ l (et'), which implies (A. 419). 



(A.418) 
(A.419) 



\M(st),g' 



/dK). ■■■) Wnl 

et' = f D {4) 



\M(st),g' 



(A.416) and (A.419) imply (A.420). Clause 3', the assumption that g' £ G, (A.418), 
and (A.420) imply (A.421). 



(A.420) 
(A.421) 



^i^M(st),stJ D (v' t '),PTS,g' = rp 

\Vi,v 2 ,v 3 ,... ,v k ;v t ) G eval(st,iZ ) 



The definition of uj\, . . . ,u n in the translation rules implies that u% £ {1, 2,3, ... ,k} and 
r Wi = 1 , Then, from (A.418) we get (A.422). (A.422) and the fact that ft G VftiZS 
imply (A.423). 



(A.422) 
(A.423) 



|M(st), a ' 



/d(v) = IK* II = HP* 

M<) = <7'(ft) 



\M(st),g' 



(|A.397| ) (proven above) and ( |A.401|) imply (A.424). (A.423), (A.424), and (A.417) 
imply (A.425). 



(A.424) 
(A.425) 



/d(v) c /d(^; 



For I £ {2,3, ... ,n}, the definition of u>i, . . . , u n in the translation rule implies that: 



(A.426) 
(A.427) 



w, £ {1,2,3,...,*;} 



Pi 



(A.418) and (A.426) imply (A.428). (A.428), (A.427), and the fact that £ VARS 
imply (A. 429), which in turn implies (A. 430). 



(A.428) 
(A.429) 
(A.430) 



•u>i I 



\ M(st) ' 9 ' = /D«) 

M\ M{st) ' 9 '=g'(f3i) = fD(v") 



v"=f D Hg'(Pi)) 



u ui - J D 

Since I G {2,3,.. . ,n}, (A.417) implies (A.431). (A.430) and (A.431) imply (A.432). 



(A.431) 
(A.432) 



g'(Pi) = g(M 
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( A.397| ) (proven above) and the fact that / G {2,3, ... ,n} imply that g(fii) = /d(^)> 
which in turn implies (A.433). (A.432) and (A.433) imply (A.434). (A.434), flATIm] ), 
and the fact that / G {2, 3, . . . , n} imply (A. 435). 



(A.433) 
(A.434) 
(A.435) 

(A.434) and (A.435) imply that 
Hence, flOHj ) holds. 



/ D 1 ( 5 (A)) = ^ 

< = "J 



I have proven that for every I G {2, 3, 4, . . . , ra}, 



(A.436) 



v' v" 



(A.421), (|A.436|) , and (A.425) are against ( |A.402j) . Therefore, the hypothesis that 
there is an et' G PERIODS and a, g' £ G, such that (A.416) and (A. 417) are satisfied 
cannot hold. ( A.399 ) has been proven. The proof that the LHS of flQgp is a subset 
of the RHS has been completed. 



I now prove that the RHS of (A. 393) is a subset of the LHS. I assume that for some 
g G G, (g(Pi), 5(^2), • • • ,g(Pn)) is an element of the RHS of ( |A.393 ), i.e. that for some 
g and et, ([Q37D - flQ4C| ) hold. 



(A.437) 
(A.438) 
(A.439) 
(A.440) 



9£G 

et G PERIODS 

11 ^/nM (si), st,et,PTS,g _ j> 

for no et' G PERIODS and g' £ G is it true that 

<?(/%) = </(/%), g(Pn)=g'(Pn) 



I need to prove that {g((3\),g((3i), . . . ,g(f3 n )} is also an element of the LHS of ( A.393j ), 
i.e. that for some v\, . . . ,v n G D: 



(A.441) 
(A.442) 



g{Pl) = /D(«l), • • • , 5(/3n) = /oK) 

(«!,..., u n ) G eval(st, S) 



I set wi, . . . ,v n as in (|A.443| ), which implies ( |A.44l|) . 

(A.443) Vl = fB l (9(Pl)), ...,v n = fB l Wn)) 

It remains to prove ( |A.442| ). By ( |A.392j ), it is enough to prove that for some v[, v' 2 , 



> v 'k, v 't- 



(A.444) 
(A.445) 
(A.446) 



(v'i, ■ ■ -,v' k ;v' t ) G eval(st,T!) 

Vl=v' LU1 , V n = 

if (v'{, v%, . . . , ujf ; O G eval(st, £') and 



«1> u L 



» ^ 



t , then/ D «)^/ D «) 



where u>i, 



, uj n are as in the translation rule. 
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I set v[,...,v' k as in (A. 447), which implies (A. 448). I also set v' t = f D 1 {et)^ which 
implies (A.449). Clause 3', flP37l) , (A.448), (A.449), and flQgp imply ([044] ). 



(A.447) 
(A.448) 
(A.449) 



4 = /dHINI^), ...,^ = /^(||r fe ii^)^ 
| Ti ||M(^ = /dM ) 5 || Tfc ||M( s t), s = /DK) 

et = f D (v' t ) 



I now prove ( A.445| ). ( A.443| ) implies that (A. 450) holds for every i G {1,2,3, . . . , n}. 
Since fa G VARS, from the semantics of Top we also get (A. 451). (A. 450) and (A. 451) 
imply (A.452). 



(A.450) 
(A.451) 
(A.452) 



<KA) = m\\ M(st) ' 9 



f 



D 



1 rii/ : i-ii M ( s *)'3'i 



The definition of uj%, . . . , u) n in the translation rule implies that: 



(A.453) 
(A.454) 



fa = Tuj, 

G {1,2,3,... , k} 



Using (A.453), (A.452) becomes (A.455). (A.454) and (A.447) imply (A.456). 



(A.455) 
(A.456) 



\M(st),g\ 
\\M{st),g\ 



(A.455) and (A.456) imply that = v'^, . I have proven that for every i G {1, 2, 3, . . . , n}, 
Vi = v' w .. Hence, ([A.445Q holds. 



It remains to prove ( A.446| ). Let us assume that: 

(A.457) (v'{, vl vl v'l; v'{) G eval(st, E') 

(A.458) 



v' = v" v' 



v" v' 



We need to prove that /z^i^) [Z^ foW^)- Let us assume that this is not true, i.e. that 
(IA.459D holds. 



(A.459) /d«)c/d«) 
Clause 3' and (A.457) imply that for some g': 
(A.460) g G G 



\M(st),g' 



f D (v"), \\r k 



\M(st),g' 



(A.461) 

(A.462) \U'\\M(3t),st,f D (v' i '),PTS,g> = T 

I set et' as in (A.463). Then (A.462) implies (A.464). 



(A.463) 
(A.464) 



et' = f D {v'{) 

h /\\M(st),st,et',PTS,g' 
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(A. 457) and clause 2' imply that v' t ' G Dp, which in turn implies that foWl) G 
PERIODS. Then, (A.463) implies that: 

(A.465) et' G PERIODS 

For i G {1,2,3, ... ,n}, (A.461) and (A.454) imply (A.466). (A.466), (A.453), and the 
fact that /3i G VARS imply (A.467). 

(A.466) Ikjr^' = M<) 

(A.467) Hail^W = </(j 3i ) = f D (y» i ) 

(|A.441[) (which holds because of (|A.443[) ) and the fact that i G {1,2,3,... ,n} imply 
(A.468). ( |A.445| ) (proven above) and the fact that i G {1, 2, 3, . . . , n} imply (A.469). 
(A.468) and (A.469) imply (A.470). 

(A.468) gift) = f D ( Vi ) 

(A.469) Vl = v' Ui 

(A.470) </(&) = MO 

(A.470) and (A.467) were proven for i G {1,2,3, . . . , n}. For i = 1, from (A.470) we 
get (A.471), and from (A.467) we get (A.472). dQ5S|) , (A.471), and (A.472) imply 
(A.473). 

(A.471) 9(Pi)=fD{vU) 
(A.472) = M<) 

(A.473) 5 (/3i) C ^(A) 

For i G {2,3, ... ,n}, from (A.470) we get (A.474), and from (A.467) we get (A.475). 
(A.458) and the fact that I G {2, 3, . . . ,n} imply (A.475). (A.474) and (A.475) imply 
(A.476). 

(A.474) <7(A) = M<) 

(A.475) ff / (A) = /i?«K=< 

(A.476) ff(A) = /o«) 

(A.476) and (A.475) imply that g(/3/) = g'(Pi)- I have proven that for every I G 
{2, 3, ... , n}, 5 (A) = 5 '(A)- Therefore, flfcfeg) holds. 

(A.477) 5 (/3 2 ) = g'((3 2 ), g{fo) = g'((3 3 ), g{f3 n ) = g'{(3 n ) 

(|A.465|) , (A.460), (A.464), (A.473), and QA.477D are against (|A.440Q . Therefore, the 
hypothesis that ( A.459]) is true cannot hold, i.e. JdW^) ^ ). ( |A.446| ) has been 

proven. The proof that the RHS of (A. 393) is a subset of the LHS has been completed. 



